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ABSTRACT 



In order to assist administrators of long-term 
facilities in responding to the changing environment 
of and demand for institutional care of the elderly 
and chronically infirm, a unified approach to patient 
classification and nurse staffing has been developed. 

The approach is essentially structure oriented and is 
directed toward the presentation of alternative nurse 
staffing strategies; its basis is the quantification of 
patient nursing needs through intermediate steps of 
assessment and classification. The classif ication system, 
in turn, has been related to the demand for nursing 
services, as categorized by care area, for each of 
three homogeneous patient groupings. Effective allo- 
cation and assignment of patient nursing care activities 
have been modeled by use of techniques of mathematical 
programming. 

The ability to predict, with reasonable confi- 
dence, the level of care to which a patient would 
most appropriately be assigned based on an assessment 
of various functioning status items, psycho-social 
status indicators, and medically defined conditions 
was considered initially. Application of an existing 
nonlinear multiple regression technique to patient 
assessment data drawn from a comprehensive assessment 
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instrument yielded a means by which patients could be 
classified according to 37 health status indicators. 

The regression method was chosen for its specific 
applicability to problems of fitting a polychotomous , 
ordered response, analogous to the level of care. In 
order to achieve a classification system of reasonable 
size for use in typical long-term care facilities, 
various subsets of the original 37 variables were 
tested for their prediction and recognition powers ; 
a subset of 12 variables was shown to possess adequate 
capabilities in this regard. This subset was synthesized 
into an implementable patient classification system 
procedure . 

The determination of an optimal staff mix,- 
allocation of nursing time, and assignment of nurses 
to patient care demands was captured in the Basic 
Staffing Model (BSM) , a mixed-integer linear program. 

The BSM maximizes an objective function that combines 
the concepts of the appropriate assignment of nursing 
personnel to specific patient care activities and 
the satisfaction of high priority demands at greater 
than minimum levels, if possible. Model constraints 
include the availability of nursing resources, budgetary 
limitations, legal staffing requirements, and adherence 
to bounded representations of patient nursing care 
activities. Two extensions to the BSM, Model I and 
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Model II, were developed to facilitate the exploration 
of alternative staffing policies. Model I allows for 
the parametric representation of changes in the total 
patient-centered services provided and personnel budget, 
while Model II enables parametric alterations in upper 
and lower bounds on staff and patient demands to be 
examined . 

Two algorithms were derived for the efficient 
solution of both BSM extensions. Recognizing the forms 
of the models, branch and bound methods incorporating 
post-optimality analysis in specialized linear program- 
ming routines were developed. These methods were based 
on the Upper Bounded Dual Simplex algorithm of Wagner; 
their computational characteristics for both general 
parametric mixed-integer linear programs and specific 
examples of applications to nurse staffing problems 
were investigated. 
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Chapter I. Introduction: The Need for Research 

1.1 General Discussion 

As the twenty-first century approaches, one of 
the most important components of our multi-billion 
dollar a year national health system, long-term care, 
seems certain to become even more vital. With age being 
the primary, although not exclusive, determinant of entry 
into the long-term care system, demographic projections 
indicate that potential demands will be high. Brody 
[12] notes that while at the turn of the century some 
three million persons (4% of the U.S. population) were 
age 65 or over, we now have approximately 20 million 
persons (10%) in this category. Figures cited by Flagle 
[24] show a 55% increase by 1990 in persons 65 or over, 
based on 1970 census data for the state of Maryland. 

The United Nations estimates that for the period 197 0- 
1980 the world's aged population will grow by 38% while 
the total population increases by 28%. 

The response of Federal and state governments to 
this trend has for the most part been in the form of 
specific provisions in the current Medicare/Medicaid 
legislation.* These programs provide support or 

* Titles XVIII and XIX of the Social Security Act. 
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supplementary assistance to those persons in need of 
long-term care and who meet certain eligibility criteria. 
Still being debated is proposed legislation establishing 
a National Health Insurance (NHI) system which presum- 
ably will contain payment structures for some forms of 
care of the chronically ill. Flagle [24] notes that the 
five-year period following the installation of Medicare 
saw a 68% increase in nursing home patients. If this is 
any indication of the underlying elasticity of demand for 
long-term care service, the potential impact of NHI could 
be highly significant, especially if payments are weighted 
towards institutional care rather than possible alterna- 
tive care programs. The relative magnitude of this impact 
can be estimated by noting figures prepared by the Depart- 
ment of Health, Education, and Welfare (HEW) [75] showing 
that there are currently some 16,000 long-term care 
facilities in the U.S. caring for about 1.1 million per- 
sons, representing approximately 5.2% of the population 
over 65. 

Because of the large amount of federal funds 
being devoted to health care in general, and with the 
probable expenditure of much larger sums in the future, 
the need for accountability and cost-benefit analysis has 
become evident. The legislative reaction has been the 
creation of Professional Standards Review Organizations 
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(PSRO) on a state-wide basis to set standards and cri- 
teria for the various component specialties of the 
health delivery system, and to monitor compliance with 
such guidelines. The enabling legislation* specifically 
calls for such standards review organizations in the 
long-term care area, although to date little progress 
has been made towards complying with this mandate. It 
is certain, however, that the decisions ultimately made 
will bear heavily on both the forms of long-term care 
delivery and the quality of care provided. 

The potential impact of population increases, cur- 
rent and proposed publicly-supported health insurance 
systems, and PSRO taken both individually and in concert 
have given rise to problems that must be dealt with by 
national and regional policy-makers. It is incumbent 
upon them to predict the resource requirements generated 
by these factors in a number of alternative care settings 
and to choose the mix of services which will most ade- 
quately serve the client population. Assessment of the 
social and technological requirements for prolonging 
patient independence in home care settings must be carried 
out and weighed against the alternative of institution- 
alization. Secondary effects such as those to be felt 



* 1972 Amendments to the Social Security Act. 
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in the labor and education markets for health profes- 
sionals must be recognized. Where increased utilization 
of institutions is projected, plans must be made to in- 
crease existing capacity either by direct government 
involvement or through incentives to the private sector. 
In short, there is much work to be done on the "macro" 
planning level before the predictions become reality. 

1.2 The Need for Research at the Facility Level. 

It should be obvious at this point that regard- 
less of the increasing emphasis on alternative settings 
for long-term care, the future portends greater require- 
ments for quantity and quality of care to be provided by 
facilities. Being licensees of their respective state 
governments, with additional federal certification re- 
quired in order to care for Medicare patients, individu- 
al facilities must be capable of reacting in a positive 
way to the requirements set forth by regulatory agencies 
In doing so, they will in a sense be translating to a 
"micro" level those broad objectives mentioned earlier. 
This dissertation represents an effort towards assisting 
the individual facility in this task. 

As an historical note the reliance upon an 
institutional setting for care of the aged and chronical 
ly infirm is a relatively recent phenomenon. Brody [12] 
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identifies six distinct periods in the history of long- 
term care in the U.S. The colonial period and the period 
of the 19th and early 20th centuries to 1920 were charac- 
terized by care provided in the home, with relatively 
little dependence on locally-financed public almshouses. 
Between 1920 and 1935 the rise of the eleomosynary insti- 
tution and the creation of publicly-financed care systems 
took place, after which the years 1935-1945 constituted 
a time of reaction to such developments. The postwar 
years to 1965 saw the increased entry of privately-owned 
proprietary nursing homes into the LTC system, while con- 
currently, the major mechanisms for public and private 
support were being molded. The advent of the current 
period is marked by the enactment of Titles XVIII (Medi- 
care) and XIX (Medicaid) of the Social Security Act of 
1965, from which the concepts of a stratified LTC system 
responsive to patient needs are derived. 

Since we shall later have cause to quantify the 
relationships between patient health status and the 
levels of care of the stratified system, we pause to 
clarify some definitions. Because most of this study 
was conducted in the State of Maryland, definitions of 
LTC facilities are provided below as taken from that 
State's Report of the Governor's Commission on Nursing 
Homes [56, p. 117]; roughly analogous interpretations 
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are given in other sources (cf. Coggeshall [16]). Note 
that to a large extent, the levels of care correspond 
to the type of facility involved, and we shall therefore 
use the terminology interchangeably.* 

Chronic Disease Hospital : A facility that provides 

medical care to persons incapacitated by illness, 
with the intent and, indeed, mandate to rehabili- 
tate and return them to the community. 

Skilled Nursing Home : A facility that maintains facili- 

ties and staff necessary to render skilled nurs- 
ing care (which means by an RN or by an LPN under 
supervision) . 

Intermediate A Facility ; A facility that provides long- 
term care to residents whose illness is not acute 
and whose proper care requires no more than 8 
hours per day under supervision of an RN or LPN. 
Intermediate B Facility : A facility that provides 

services which a person normally provides him- 
self, but for which he is now dependent upon 
others because of advanced age, physical, or 
mental limitations. 

Current practice calls for the physical placement of a 

* More detailed discussion of specific regulations per- 
taining to levels of care will be provided later. 
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long-term care patient into a facility certified to pro- 
vide that level of care which is deemed appropriate for 
his or her needs. A change in patient requirements thus 
necessitates movement from one facility to another, 
although recent trends indicate an increase in facilities 
with multi-level certifications, thereby eliminating this 
requirement. 

With this background information in mind, we must 
necessarily consider the task of the facility administra- 
tor, upon whom the responsibility for compliance with 
regulatory agencies rests. In broad terms, the adminis- 
trator must be concerned with the adequate and timely pro- 
vision of the three components of institutional life as 
identified by Brody [12] : 

1. Basic maintenance services (shelter, food, 
sanitation) ; 

2 . Medical and paramedical services to foster 
maximum physical health and functioning 
capacities; 

3. Psychosocial components. 

The second and third of these components are highly 
labor-intensive, and their provision accounts for the 
major portion of LTC facility operating costs. Estimates 
provided by the Governor's Commission [56], Schwartz 
[63] , and HEW [75] put total labor costs at about 
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55%-60% of total operating budgets. The above studies 
indicate that payroll and fringe benefit costs attribut- 
able to nurse staffing make up anywhere from 35% to 50% 
of operating costs. In short, the administrator is con- 
fronted with a budgeting problem which is certain to in- 
volve relatively large sums to be spent on nursing staff. 

Cost, however, is neither the only nor the most 
important consideration in developing a nurse staffing 
program. Aydelotte [5] points out five additional fac- 
tors which must be weighed and acted upon in order that 
an acceptable program may be obtained: 

1. Quality of patient care to be delivered and 
its measurement; 

2. Characteristics of the patients and their 
care requirements; 

3. Prediction of the supply of nursing manpower 
required for (1) and (2); 

4. Logistics of the staffing program and its 
control ; 

5. Evaluation of the quality of care desired, 
thereby measuring the success of the staffing 
itself. 

Item (2) is perhaps the key factor among all five, since 
unless a reasonably accurate appraisal of the care needs 
of facility residents can be made, the staffing mix and 
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staffing patterns developed in concert with budget con- 
siderations will be at least partially ineffective, with 
a consequent degradation in quality of care. 

What means, then, are available to administrators 
and their directors of nursing for relating patient 
health status and the implied care needs to a nurse staff- 
ing system? Relying upon the placement status of the 
patient as an assessment tool allows the use of minimum 
staffing levels as specified by state regulations, for 
example, as a guide. The number of hours of bedside care 
per patient of each level, staf f-to-patient ratios, and 
supervisory personnel required could be determined for the 
patient mix that exists, and the minimum or near minimum 
resources maintained. if further funds are available, 
some choice could be made as to additional personnel re- 
quired. Patient placement status, however, may not neces- 
sarily be an accurate reflection of demand for nursing 
services, since many other intervening factors are in- 
volved in the placement decision. Information generated 
in the course of utilization review might give a more 
accurate picture of patient care needs, but relating such 
information to nursing requirements and hence to budgeting 
is likely to be difficult and to occur in an untimely 
fashion . 

The need for maximization of care quality for a 
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given budgetary level, and the relationship between 
patient health status and nurse staffing are thus central 
issues confronting the administrator. The evaluation of 
quality of care suggested in item (5) above is a most 
difficult task. What can be done at present is to either 
assume quality is acceptable with the staffing pattern 
employed or to rely on some appropriate measure of quali- 
ty surrogates. In the former case, a staffing pattern 
which contributes to job satisfaction by assigning nurses 
to tasks ‘commensurate with their training can lead to 
better staff morale. Revans [57,58] has demonstrated 
that staff morale is a good indicator of the quality of 
care provided. In connection with the use of quality of 
care surrogates. Burroughs [14] showed that a number of 
easily measured indicators (e.g., number of requests for 
pain medication, number of incontinent patients, length 
of stay, etc.) could be quantitatively related to subjec- 
tive estimates of quality of care by use of discriminant 
analysis. These studies were conducted in acute care 
facilities, however, and additional analysis is needed 
in the LTC area . 

1.3 The Proposed Research 

In order to provide administrators and nursing 
directors, in their roles as decision makers, with useful 
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tools for assessing the relative merit of alternative 
courses of action, a unified approach is proposed. This 
methodology is based on a multi-disciplinary operations 
research point of view and will address the following 
four issues: 

1. Budgetary restrictions; 

2. Assessment of patient care needs; 

3. Nurse staff allocation; and 

4. Quality of care. 

It will be shown that a key factor in such an approach is 
relating patient health status, through classification, 
with nursing care requirements. 

After an initial review of the existing litera- 
ture pertaining to the general problems of patient classi 
fication and nurse staffing, a classification system for 
long-term care patients into levels of care will be de- 
veloped. The system will initially take as its input in- 
formation available from a comprehensive patient assess- 
ment instrument. It will be shown that a reasonably 
accurate level of care classification can be derived from 
components of the assessment instrument by application of 
a regression-based statistical technique. A proposed 
classification instrument designed for easy use by facili 
ty staff will be demonstrated. 

We shall then forge the link between patient 
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assessment and classification, and demand for nursing 
services, using information thereby obtained as inputs 
into a mathematical programming model designed to syn- 
thesize the four elements mentioned above. Certain theo- 
retical properties of the model form will be advanced to 
better allow for examination of alternatives. Such ad- 
vances are made in the area of parametric mixed-integer 
linear programming. 

The proposed staffing model takes as its objec- 
tive to be maximized the allocation of nursing care activi- 
ties to the most appropriate nursing skill level avail- 
able and a priority measure on care area/classification com- 
binations. By invoking such a discipline, good quality care 
should be achievable. In order to assure that the model 
is computationally tractable, a grouping of the nursing 
tasks into various "care areas" will be utilized. Atten- 
tion is given to the varying needs of patients for the 
tasks of each care area by basing the model on a subdivi- 
sion of the care areas by the levels of care indicated 
through patient classification. Constraints on personnel 
budget and availability, legal staffing minimums, and 
unit aggregate demands by care area and patient classifi- 
cation are also included. In addition, the latter of 
these three constraint sets includes a priority related 
system for meeting demands. 
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Through the use of mathematical programming 
methods, staffing levels and skill level mix as well as 
allocation of direct care demands by skill level will be 
examined under a variety of circumstances. An example 
of how such information may be generated and presented 
will be provided. Finally, we will conclude by noting 
difficulties encountered in the course of the research, 
conclusions reached , and recommendations for further 
study . 



Chapter II. A Review of Existing Methodologies for 
Patient Classification and Nursing 
Resource Allocation 

II. 1 Introduction 

In order to place the current study into proper 
perspective, we briefly review some of the major studies 
most closely related to our twin objectives of patient 
classification and nursing resource allocation. The sum- 
maries presented are in no way meant to be an exhaustive 
survey of preceding work; rather, they represent an 
effort to capture the essence of the extensive body of 
significant literature in the field. 

Studies involving classification, and the con- 
comitant nurse staffing problems of allocation and assign 
ment in an acute care setting will be discussed first, 
since historically they provide the conceptual basis for 
subsequent work in long-term care. An examination of 
work specifically devoted to this latter area will follow 
Further references and amplification of earlier research 
may be found in the surveys conducted by Aydelotte [5] , 
Stimson and Stimson [71], and Shuman and Wolfe [67]. 
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II. 2 Patient Classification in an Acute Care Setting 
Connor [18] 

It can be argued that most patient classification 
procedures in use today are based on the fundamental con- 
cepts proposed by Connor in his pioneering studies of 
nursing unit staffing requirements. Essentially, the aim 
of his study was to determine the underlying functional 
relationship between patient care needs and the response 
to these needs in terms of nursing manpower. If patients 
can be assessed and their care needs anticipated, a more 
flexible staffing pattern is possible than the commonly 
used ratios based primarily on census counts. As an ini- 
tial step the amount of direct care provided to a sample 
patient population was measured. It was found that the 
time devoted to direct care for individual patients by 
the nursing staff varied with the degree of self- 
sufficiency of the patient, i.e., the more self-sufficient 
the patient, the less direct care he received. Moreover, 
it was found that the patient population could be classi- 
fied into essentially three groups: 1) a self-care group 

at one extreme consisting of patients who were able to 
take care of most of their needs and made little or no 
demand on nursing staff, 2) a total-care group at the 
other extreme consisting of patients who were often seri- 
ously ill or bedfast and made large demands on nursing 
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staff for such care activities as feeding, bathing, etc., 
and 3) an intermediate care group who required only par- 
tial assistance from the nursing staff. The three group- 
ings were verified by the emergence of a tri-modal dis- 
tribution of measured direct care times, as shown in 
Figure II. 1 

A classification scheme was developed, using a 
Boolean logic, that classified patients into self, par- 
tial, and total care categories on the basis of those 
factors that reflected their degree of self-sufficiency. 
Indeed, it was these factors that correlated with the 
large demands made on nursing staff in the provision of 
direct care time. The basic classification scheme, and 
the factors involved, is shown in Figure II. 2. 

Connor then determined the average number of 
hours of direct care devoted to patients in each of the 
three patient categories by noting the group averages 
for the three populations as shown in Figure II. 1. After 
determining the average number of hours devoted to in- 
direct care by additional work sampling on typical 
medical-surgical units, Connor proposed that the total 
nursing hours for a given patient mix on an acute care 
unit could be predicated by first computing a Direct Care 
Index 

= .5N. + 1.0N 2 + 2.5N 3 
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(II. 1) 
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Figure II. 1 



Distribution of Minutes of Direct Patient 
Rendered as Found by Connor 



Care 
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Figure II. 2 

Connor's Combinations of Factors for 
Categorization of Patients 



0©ry I. Qetf -tore 

Any o7 the following comblnollons checked 
(o) Ambulatory, or up ' In choir — Self (without 
assistance) 

Feeding self, or requires food cut 
Etalhing In bolhroom, or ot bedside — Partial 
Self (con bathe self except for bock end per- 
haps extremities) 

(b) Ambulatory — with assistance 
Up In choir — Self 

Bathing In bathroom, or at bedside — Partial 

Self 

(cl As In (a and b) with 
Vision Inadequate 
Oxygen theropy 
I.V. feeding 

buf no two of theso factors simultaneously 



Cofogery It, Portlet «r lnt9m«c!ts>o cere 



Any of the following combinations checked 
(a) Ambulatory — with assistance 

Bathing In bathroom, or at bedside — Partial 
Self 



Feeding — Complete Assistance (except I.V. 
feeding) 



Vision Inodequote 
Oxygen iheropy 



} optional (does not of- 
fect classification under 
these conditions) 



(b) Up In chair — 5o!f 

Bathing at bodsldo — Complata Assistance 
Feeding Saif, or requires Tood cu? or I.V. 
feeding 

Oxygen theropy 
Vision Inadequate 

(c) As In |b) with th* following change® 

Up In chair — V/ifh Assistance 

Bath at bedside 

(d) Up In choir — With Acslslonco 

Both ot bedside — Partial Self 

Feeding — Complete Assistance 

Vision inodequeta 1 
^ , >opllonal 

Oxygen therapy J 

(e) Bath ot bedside 

Feeding — Self, or requires food cut, or I.V. 
feeding 

Vision Inadequate 1 , 

_ , Voptloncl 

Oxygen theropy J 

(!) Being Speclaied of Necessity (patient has 
continuous nursing assistance to the exfont 
♦hot meal relief must be provided for fpeclal 
duty nurse) 

Norr. A*f potion} wfi4 efhorwfir foMi ln»o Col* gorln f of II. bul <*•>« 

ll vndor iwtMod »h*ropr Of It Iff liKOnllntni llntludln $ wound 

dtoinaiio (boffgo ol bod Itoont, of fwo'ltd’y tmollmnalty 

dhlurb*d 1r>*mdt o tfooil tontlonl obio/voHon, In linjli loom, rrcolcl 
dlttutbont*! will bo dropptd to tb« n**l toltgoff. 

C mtmgpry ttt. Intensive, or "tetol", core 

All combinations not previously mentioned. 



I optional 
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where 

N ^ = number of patients classified as Category I. 
^ = number of patients classified as 
Category II, 

= number of patients classified as 
Category III, 

and the coefficients of the equation represent the care 
hours required for each category. The total work load 
can then be found from 

WL =1+20 hours (II. 2) 

where the additional factor of 20 hours represents the 
average time required for all other indirect care activi- 
ties. In a subsequent extension, a method for "controlled 
variable staffing" based on predictions from the above 
indexes was demonstrated by noting that the daily aver- 
ages and shortfalls in nursing time demanded, relative to 
the unit's basic staff, could be controlled through use of 
additional float personnel. Young [83] and Wolfe and 
Young [81] provide further insights on the practical 
aspects of the Connor staffing approach in their discus- 
sion of day-to-day implementation procedures. Although 
essentially sound and more effective than traditional 
techniques used for staffing, the method has drawbacks in 
that it neither indicates the nursing skill levels re- 
quired nor suggests how available time might be allocated 
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between patients of various categories. 

Young [84] 

It might be anticipated that a patient classifi- 
cation system, such as that proposed by Connor, might 
not be universally applicable and should therefore be 
applied with some caution. Young, in a study of the 
direct care provided to pediatric patients found that, 
although pediatric patients can be classified into age 
groupings, no classification as to degree of self- 
sufficiency within such groupings was possible. Indeed, 
as opposed to adult patients, the more self-sufficient a 
pediatric patient became, the greater the demand on nurs- 
ing time. Examination of data on direct care nursing time 
revealed a continuous distribution of times with a mean 
of about 90 minutes; i.e., the frequency distribution of 
direct care times was unimodal indicating no distinct 
subpopulations. In order to determine total minutes of 
direct care required it was felt that a multiple linear 
regression model was most appropriate, using as the re- 
gressors the 0-1 variables corresponding to the subcate- 
gorization of 12 factors determined by professional 
nurses. These are shown in Table II. 1. 

The numbers in the right-hand column are not to 
be interpreted as time required for the factors indicated; 
they are, in fact, regression coefficients. However, for 
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Table II. 1 

Young’s Classification of Pediatric Patients 

Regression Coefficient 
for Determining 

FACTOR DEGREE Direct Care Times 

MOBILITY: Complete Self 1 

With Assistance 10 

Bedfast 25 

ACTIVITY: Restrained 20 

Hyperactive 10 

Emotionally Disturbed 10 

Normal Activity 10 

Lethargic 1 

Unconscious 5 

FEEDING: Bottles x 6 5 

x 8 10 

x 12 15 

Feed Complete Assistance 25 

Feed Partial Assistance 5 

Feed Gavage 5 

Self 1 

I.V.: Normal (Simple) 1 

Special (Complicated) 2 

MED. FREQ.: Average 1 

Frequently 20 

TREATMENT: Oxygen 20 

Dressing-Compresses 50 

Suction 20 

Tracheotomy 20 

Drainage 15 

Croupette 15 

Pediatric Procedures 20 

INCONTINENT: Diapers Frequently (10,20,30 per day) 1 

Bed Change 10 

TPR: Normal Frequency 1 

Special 10 

ISOLATION: Protective 3 

Preventive 3 

FAMILY Assist -10 

MEMBERS: Non-Assist 1 

Special Attention Given 2 

SPECIAL OF Special Nurse 2 

NEC: Relief Only 1 

BATHING: Partial Assistance 10 

Complete Assistance 15 



Add constant of 40 to total 
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those factors checked for a particular patient, the sum 
of the numbers, when added to a constant of 40 does pro- 
vide the total direct care time, in minutes, required for 
that patient. Young found a very high correlation be- 
tween these predicted times and observed times. More 
significantly, it was again found that patient assessment 
and, in this instance, "continuous" classification was 
much more predictive of patient care required than a 
census count and rule-of-thumb. 

Other Studies 

For the sake of completeness we briefly note 
some additional studies involving hospital inpatient 
classification. The National League of Nursing Education 
[48], Cloussen [15], Stanford [70], and the Hospital 
Association of New York State (HANYS) [30] all have pro- 
posed classification systems based on assessment of a 
patient's physical and emotional needs as well as the 
incidence and/or frequency of various medication or 
treatment regimens. Work by Poland, English, Thronton, 
and Owens [54] also involves the use of patient classi- 
fication; however, since the primary thrust of the study 
is staffing, we defer its consideration to the next 
section . 
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II. 3 Nurse Staffing in an Acute Care Setting 

Beginning with the concepts proposed by Connor, 
the study of the nurse staffing problem has intensified 
with an ever-increasing degree of sophistication in the 
models proposed. Typically, solutions were sought for 
the following three aspects of staffing as they relate to 
demand for service: 

1. Number and skill levels of nursing personnel 
needed ; 

2. The distribution of the time of available 
personnel among direct and indirect care 
activities (the "allocation" problem) ; 

3. The delegation, by skill level, of the con- 
stituent tasks of direct and indirect care 
(the "assignment" problem) . 

Each of the methods presented below represents an attempt 
to answer one or more of these questions. 

Wolfe [80], Wolfe and Young [81,82] 

In order to overcome some of the shortcomings of 
Connor's staffing models, Wolfe proposed a methodology 
based on the concept of a "multiple assignment model." 
This approach permits the assignment of a skill level to 
more than one activity as opposed to the classical 
assignment model which imposes one-to-one assignments. 
Wolfe viewed all nursing activities in both the direct 
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and indirect care categories as falling into one of 12 
mutually exclusive task complexes as shown in Table 
II. 2. A work sampling study was undertaken to determine 
the nursing time required for each complex during an 
eight-hour day. These times were related to patient con- 
dition by regressing them on the number of patients in 
each of Connor's three classifications on the unit. The 
resulting regression coefficients, b ^ , were interpreted 
as time required for the task complex per patient of 
category N^, i=l,2, or 3; certain coefficients were found 
to be insignificantly different from zero and thus con- 
stant for all or a subset of the classifications. 

A cost, c^ j , indicating both salary cost and im- 
puted monetary disutility incurred by assigning skill 
level i to task complex j, was then determined. The lat- 
ter cost component was obtained by application of psycho- 
metric measurement techniques to judgments solicited from 
both nurses and patients. Wolfe’s integer programming 
multiple assignment model is, therefore, 

MIN ^ (c. . — <b. )x. . + ^.a.^.Y. 
l] i] l l i] 1111 



s . t . 4- x . . . 

l i] = 1 , 



a . Y . - ^ b .x . • > n 
ii j ] i] 0 , 



* j 



V 1 



(II. 3) 



x . . = 0 , 1 
13 

>0 and integer 
1 i 
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Table II. 2 

Task Complexes Identified by Wolfe 



1 . 


Technical 


Tasks I 


7. 


Clerical 


Tasks 


I 


2. 


Technical 


Tasks II 


8. 


Clerical 


Tasks 


II 


3. 


Evaluation of Pt. 
Needs and Assign- 


9. 


Clerical 


Tasks 


III 



ment 

4. Supervising and 10. Housekeeping Duties 
Teaching 

5. Preparatory Care I 11. Escorting and Emergency 

Errands 



6 . Preparatory Care 
II 



12. Maintenance, Checking, and 
Ordering of Supplies and 
Equipment 
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where 

<<^ is the hourly wage of skill level i, 

a^ is the number of hours worked by each member 
of skill level i (Wolfe assumed a^ = 8) , 
is the assignment variable, a value of 1 
indicating the assignment of skill level i to 
task complex j, and 

is a variable indicating the number of person- 
nel of skill level i required. 

Closer examination of (II. 3) reveals that the objective 
function calls for the minimization of an assignment cost 
plus a non-productivity cost, assuming the following: 

1. All task complexes must be assigned; 

2. Ample personnel of all skill levels are avail- 
able to meet the requirement Y^; 

3. No partial shifts are allowed; 

4. No splitting of task complexes between skill 
levels is allowed. 

Probably because of these assumptions, Wolfe claims not to 
have touched upon the allocation and assignment problems 
noted earlier, but rather to have provided a means for 
determining the number and skill levels of nurses required 
on a unit. This is the primary drawback of his methodology. 
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Singer [68] 

Recognizing the stochastic nature of require- 
ments for nursing care as a function of a patient's con- 
dition, Singer attempted to improve the staffing aspects 
of Connor's work by modeling the fluctuations in the num- 
bers of patients in each of the three classifications in 
a time-dependent framework. He was able to do this by 
using a multiple-population, immigration-death process 
model, and provided good predictions of Connor's direct 
care index over relatively short horizons. The model 
did not perform as well for time periods greater than 
about three days. Unfortunately, few indications as to 
how these findings may be put into practice are given. 
Jelinek [32,33] 

In an effort to predict the implications of any 
number of several well known factors on some convenient 
measures of nursing activity, Jelinek proposed the use of 
a multiple linear regression model. A major result of 
such a methodology is isolation of effects through con- 
trolled experimental design. The author reports an appli- 
cation of this technique in regressing variables indicat- 
ing the number of hours per patient-day devoted to direct 
care, indirect care, and non-productive activities on a 
set of independent variables related to patient length of 
stay, health status (as per Connor's classification), type 
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of unit, unit census, and staff size and skill level mix. 
Among the findings reported were decreasing returns to 
scale for direct care activities as a function of avail- 
able staff, with a corresponding increase in non- 
productive activity. The findings also emphasized the 
poor predictive power of patient census relative to nurs- 
ing activity. 

Poland, English, Thornton, and Owens [54] 

The authors offer a normative model for predic- 
tion of demand for direct care by means of an attractive- 
ly designed system called PETO. Through work sampling, 
the relationships between various criterion levels of 
seven patient assessment factors and the number of hours 
of required "physical nursing care" were determined. 

These times were then translated into a point scale for 
each criterion level. Thus, a patient would be assessed 
in accordance with Table II. 3, then the point total con- 
verted to "physical care units" (PCU) by means of Table 
II. 4. Finally, the unit direct care work load can be 
found by adding all PCU's. Based on nursing care audits, 
the amount of required care delivered on sample units was 
found to be significantly increased after installation of 
the system. Unfortunately, however, PETO failed to indi- 
cate how staffing by skill level as well as allocation of 
resources may be achieved . 
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Element 



Diet 



Toileting 



Vital signs 
and 

measurements 



Table II. 3 

PETO Elements of Care 
Description 



Point Value 



Feeds se:lf without super- 
vision, or family or 
parent feeds patient. 1 

Feeds self with supervision 

of staff. 2 

Feeds self with constant 

presence of staff, or gas- 
trostomy feeding every 4 
hours. 4 

Total feeding by personnel 
or instructing the parent, 
or continuous I.V., or 
blood transfusion. 8 

Tube feedings more frequent- 
ly than every 4 hours. 12 

Toilets without supervision. 1 

Toilets with supervision, or 
specimen collection or 
uses bedpan. 2 

Up to toilet with standby 
supervision, or output 
measurement every hours, 



or daily colostomy irriga- 
tion. 4 

Incontinent, average output. 8 

Incontinent with diarrhea, 
or immediate postoperative 
colostomy, or urethrostomy 
or drainage with frequent 
dressing change. 12 

Routine — daily temperature, 

pulse, and respiration. 1 

Vital signs every 4 hours, 
or night check every 4 
hours. 2 

Vital signs monitored, or 
hypothermia , or vital 
signs every 2 hours. 

Vital signs and observation 
every hour, or vital signs 
monitored, plus 



4 
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Table II. 3, cont'd. 

Element Description Point Value 

hypothermia and 

neurological evaluation. 8 

Blood pressure, pulse, respi- 
ration, and neurological 
evaluation every 30 minutes. 12 

Respiratory Bedside humidifier, or blow 
aids bottle. 1 

Mist or humidified bassinet 
when sleeping, or cough and 
deep breathe every 2 hours, 
or IPPB without supervision 
every 4 hours. 2 

Continuous oxygen, or cough 
and deep breathe every hour, 
or continuous assisted venti- 
lation. 4 

Mechanical respiratory aid, or 
IPPB with supervision every 
4 hours. 8 

IPPB continuously with inter- 
mittent hand ventilation. 12 

Suction Routine postoperative standby. 1 

Nasopharyngeal or oral suction 

as needed 2 

Tracheostomy suction every 
hour, or nasogastric tube 
irrigation every 2 hours. 4 

Tracheostomy suction every 30 

minutes, patient responsive. 8 

Tracheostomy suction every 30 
minutes, patient not re- 
sponsive. 12 

Cleanliness Bathes self, bed straightened. 1 

Bathes self with help or 

supervision, daily change of 
bed . 2 

Bathed and dressed by person- 
nel, or partial bath given, 
daily change of linen. 4 

Bathed and dressed by person- 
nel, special skin care, 
occupied bed. 8 
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Element 



Table II. 3, cont'd. 
Description 



Point Value 



Turning and/ Up in chair with assistance 
or assisted once in 8 hours, 

activity Up in chair with assistance 

twice in 8 hours, or walk- 
ing with assistance 

Bedfast with assistance in 
turning every 2 hours, or 
up walking with assistance 
of two persons twice in 8 
hours . 

Bedfast with assistance in 
turning every hour. 

Turning on orthopedic frame 
every hour . 



1 

2 



4 

8 

12 



PETO Conversion 



Points 



4-11 

12-19 

20-27 

28-35 

36-43 

44-51 

52-59 

60-67 

68-75 

76-83 



Table II. 4 

of Points into 

Physical 
Care Units 
(PCUs) 



1 

2 

3 

4 

5 

6 

7 

8 
9 

10 



Hours of Care 



Hours of 
Nursing Care 



1 

2 

3 

4 

5 

6 

7 

8 
9 

10 
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Shuman [66] 

Although the work of Shuman is primarily con- 
cerned with regional health system planning models de- 
signed to demonstrate methods of increasing productivity 
and minimizing the cost of providing services, it is 
worthy of further examination for two reasons. First, 
it provides insight into the model-building aspects of 
operations research as applied to health system problems. 
Secondly, an extension of Wolfe's model is presented 
which provides an answer to the allocation question not 
given in the original study. 

The extension is made in the following way: with 

reference to (II. 3) the variables x^^ are taken to be 
the number of times personnel type i performs task com- 
plex j. The x„ are integers. The Y^, on the other hand, 
are now assumed to be continuous variables. Finally, an 
additional constraint set 




is added to the model, where d^ represents the number of 
times task complex j is demanded. Obviously, the task 
complex demands may be assigned to several personnel 
types, and the resulting model solution yields an indica- 
tion of personnel allocation. Shuman claims that by 
allowing to be continuous, a part-time staffing policy 
can be inferred from the model solution. This, in fact. 



33 



may be difficult to do because the optimal values of 
are as unlikely to terminate in half-time units as they 
are in full-time units. Thus, while the extended model 
is certainly an improvement over Wolfe's original model, 
it nevertheless is not completely representative of 
reality. Shuman provides a further modification to 
(II. 3) indicating how this model may be used for several 
units simultaneously. 

II. 4 Patient Classification in a Long-Term Care Setting 

Classification systems for long-term care have 
evolved essentially from those developed earlier for 
acute care. They have required some modification, mainly 
because of the need to specifically address the problems 
of aging and/or chronic disease. As will be seen, the 
studies have, in general, emphasized the following three 
areas : 

1. Degree of self-sufficiency in daily activities 

2. Mental and emotional state; 

3. Rehabilitative and instructional needs. 

Katz, et al, [36] 

The study of Katz, et a^. , with respect to the 
recovery process of aged patients, is notable for its 
prescient recognition of the importance of degrees of 
patient independence in activities of daily living (ADL) 
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as they relate to nursing service required. The ADL ' s 
identified as being most important are 

1. Bathing, 

2. Dressing, 

3. Toileting, 

4. Transferring, 

5. Continence, 

6. Feeding, 

with each further subdivided into a tri-level evaluation 
of independence. An eight category "Index of Indepen- 
dence in Activities of Daily Living" is offered, as shown 
in abbreviated form in Table II. 5. This instrument was 
tested on patients representing numerous primary diagnos- 
es. Katz found that almost universal applicability was 
possible regardless of diagnosis. Further, the phenome- 
non of increasing demand for nursing services as an 
ordered function of the index category was observed. The 
existence of factors of primary significance among the 
six in predicting degree of patient dependency was con- 
jectured, but little follow-up study is reported. 

McKnight [41] 

As part of a study conducted to determine the 
amount and type of care received by nursing home patients 
in the Denver area, McKnight developed a set of "Criteria 
for Patient Nursing Need" which was used to specify a 
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Table II. 5 

Katz, et_ aJL. Classifications of Independence in ADL 



Category 


Description 


A 


Independent in feeding, continence, trans- 
ferring, going to toilet, dressing, and 
bathing . 


B 


Independent in all but one of these functions. 


C 


Independent in all but bathing, and one 
additional function. 


D 


Independent in all but bathing, dressing, and 
one additional function. 


E 


Independent in all but bathing, dressing, go- 
ing to toilet, and one additional function. 


F 


Independent in all but bathing, dressing, go- 
ing to toilet, transferring, and one addi- 
tional function. 


G 


Dependent in all six functions. 


Other 


Dependent in at least two functions, but not 
classifiable as C, D, E, or F. 



three level 


(minimum care, moderate care, maximum care) 



patient classification system. These levels of classifi- 
cation are essentially analogous to the three levels of 
care Intermediate B, Intermediate A, and Skilled nursing 
care defined in Chapter I. The criteria identified were 



as follows: 




1 . 


Diagnostic description (primary and secondary) 


2. 


Nursing activities (medications, treatments. 
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special diets, etc.); 

3. Physiological factors (vision, hearing, con- 
tinence, mobility, etc.); 

4. Psycho-social factors (emotional and social 
behavior) ; 

5. Rehabilitative factors (functional ability, 
both mental and physical) . 

Classification was dependent upon the similarity of 
patient condition with several descriptive items speci- 
fied for each criterion and category. 

A major portion of this study was devoted to de- 
termining the amount of care received by patients of the 
various classification levels. Time study analysis was 
undertaken to determine the performance times for 25 
nursing care areas, both by patient classification and 
nursing skill level. In addition, activity sampling was 
performed in several of the nursing homes under study to 
give an indication of how nursing personnel time was allo- 
cated. It should be noted that the Denver work included 
the replication and extension of a study conducted by the 
City of Milwaukee Health Department [47]. 

Both the patient classification and nurse staffing 
aspects of McKnight's efforts will play a large role in 
our work. More will be said concerning this study in 
Chapter III. 
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Salmon, et al. [61,62] 

The RAPIDS system developed by Salmon, et a_l. has 
as its major objectives 

1. The forecasting of patient needs on input to 
the long-term care system, and 

2. Placement assessment, both initial and follow- 
up . 

A rating scale of one to five, one indicating no effort 

required and five indicating maximum effort required, was 

attached to each of the six areas denoted by the acronym 

R - Restorative procedures 
A - Activities of daily living 
P - Problem behavior 
I - Illness 

D - Dependency, general 
S - Social service 

In applying this system, the authors discuss its successes 
in meeting the second of their objectives, but similar 
outcomes were not obtained for the first goal. No attempt 
was reported concerning the relationship between patient 
classification and staffing levels. 

Burack [13], Burack and Denson [50] 

Noting the need for an interdisciplinary set of 
indicators of geriatric patient status, Burack proposed 
a five-item list of guidelines as follows: 

1. Medical classification (related to serious- 



ness of disease) ; 
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2. Functioning performance (independence in 
ADL ) ; 

3. Projected goals (restoration of activity 
levels) ; 

4. Therapeutic category (need for professional 
service, medications, or treatments); 

5. Services required (medical, nursing, psy- 
chiatric, rehabilitative, social work). 

Notice that the concept of goal-oriented care is empha- 
sized in this system, a feature that has typically been 
relegated to minor status in other classification systems 
Burack encourages further subdivision of the basic guide- 
lines by care providers to better reflect their own needs 
It is unfortunate that no indication of how this classi- 
fication may be translated into nursing action is given. 
Parker [52] 

In the first of two studies designed to elicit a 
mathematically-based geriatric patient classification sys 
tern, Parker examined the relationship between the 19 
health status indicators and 4 health-care need status 
categories shown in Table II. 6. Given that the binary 
vector S represents a patient profile in terms of the 
indicators and d^, i=l , 2 , 3 , 4 , the health care status, it 
was demonstrated that Bayes' Theorem could be used to 
determine the probability of a patient with profile S 



39 



Table II. 6 

Definitions of the Indicators and Categories 
Used by Parker 



Indicators (=1 if present, =0 if absent) 



1 . 


Single (=1) , Married 
( = 0) 


11. 


Any social disability 


2. 


Drugs prescribed (=1 
if at least one 
drug presceibed) 


12. 


Immobility--less than 
independent ambula- 
tion on level surface 


3. 


Cancer 


13 . 


Needing to be fed 


4 . 


Senility 


14 . 


Special treatments 
needed 


5. 


Vascular lesions of 
the central ner- 
vous system 


15. 


Major assistance dressing 
and bathing 


6. 


Diseases of heart 


16. 


Conscious but unable to 
communicate needs 


7. 


Diseases of skin 


17. 


Aberrant behavior 


8 . 


Arthritis 


18 . 


Mild confusion 


9. 


Decubiti, ulcers, 
fistulas 


19. 


Severe confusion 


10. 


Total or occasional 
incontinence 







Health-Care Need Status Categories 

1. Private home 

2. Personal care, home 

3. Skilled nursing home 

4. Chronic disease hospital 
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being in category d . from 



1 




i=l 



(II. 4) 



where 




S 1 0d i ) • • • • *P (S 19 S 18 0d i ) (III. 5) 



t h 

and indicates the presence of the l 



indicator. Using 



such a rule to classify 1,245 Medicare/Medicaid patients, 
Parker achieved a prediction rate of 70%. Methods were 
suggested for use in extracting a subset of the original 
variables which appeared most important in computing the 
categorization. These "key indicators" were found to be 
(in order of importance) 



1. Mobility, 

2. Continence, 

3. Major assistance dressing and bathing, 

4. Any special disability, 

5. Severe confusion, 

6. Special treatments needed. 



Reducing the set of indicators by using only the key 
indicators above, a prediction rate of 70% was also 
achieved. Further comments on this approach will be made 
in Chapter IV. 

Parker and Boyd [53] 



In the second of his two studies, Parker explored 
both the use of and relationships between a discriminant 
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analysis approach and a multi-hierarchical cluster analy- 
sis approach to patient classification. Taking as data 
a sample of patient profiles derived from Collaborative 
Patient Assessment Instrument (CPAI)* scores on some 60 
items, stepwise discriminant analysis was first applied 
in order to relate these items to one of six appropriate 
levels of care. An overall prediction rate of 77% was 
achieved. The clustering technique was then applied in 
order that patients with similar profiles might be 
grouped together. Five clusters were ultimately dis- 
covered, roughly analogous to the original six-level 
categorization. Taking cluster centroid vectors as being 
indicative of membership in that cluster, Parker proposed 
that a candidate's scores be added and compared to the 
totals for the five clusters as a rough classification 
rule . 

Key variables identified by each method are shown 
in Table II. 7. Those from stepwise discriminant analysis 
were determined by order of entry into the discriminant 
function, while cluster analysis variables were inferred 
from examination of how well each variable explained 
cluster membership (the "B/T" criterion) . 



* 



A more complete discussion of this instrument and its 
development is given in Chapter III. 
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Further elaboration on both the CPAI and the 
statistical techniques employed will be provided in sub- 
sequent chapters, as well as analysis of the results. 
Note, however, the importance socio-demographic informa- 
tion plays in both findings, as well as the fact that no 
easily applied classification scheme was tested for its 
predictive capabilities. Neither of the results were re- 
lated to staffing procedures. 

Other 

Two other classification studies are noted brief- 
ly. Ashton [3] developed a set of nursing criteria for 
evaluating extended care patients, the criteria being re- 
lated ultimately to a tri-level (self-care, intermediate 
care, intensive care) classification system. Major fac- 
tors identified were continence and mobility, mental and 
social status, independence in the ADL , and clinical nurs 
ing procedures required. 

Shaughnessy, et al. [65] proposed a classifica- 
tion based on seven major factors (mental status, communi 
cation, sensory deprivation, physiological disturbances, 
physical dependence, environmental and psycho-social 
items) in order to assist in developing a triage catego- 
rization of patients. The categorization was then relat- 
ed to general personnel allocation guidelines. 
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Table II. 7 

Key Variables Determined by Discriminant 
and Cluster Analysis by Parker and Boyd 

Discriminant Analysis Cluster Analysis 



1 . 


Patient's present 


location 


1 . 


Communication of needs 


2. 


Dressing 




2. 


Present location 


3. 


Length of time at 


location 


3. 


Research site 


4 . 


Behavior pattern 




4. 


Appropriate level of care 


5. 


Age 




5. 


Dressing 


6. 


Stair climbing 




6. 


Wheeling 


7. 


Total family income 


7. 


Toileting 


8 . 


Dentition 




8 . 


Mobility 


9. 


Birthplace 




9. 


Bathing 


10. 


Toileting 




10. 


Feeding 


11. 


Mobility level 




11. 


Bowel function 



(Abbreviated list) 



II. 5 Nurse Staffing in a Long-Term Care Setting 

With one notable exception, staffing studies re- 
lated to long-term care facilities have relied largely on 
the techniques of computer simulation to explore the 
questions of personnel mix, allocation, and assignment. 

We shall discuss three such simulation studies, and a 
mathematical programming approach. 



44 



For purposes of the discussions which follow, we 
note that computer simulation usually involves the model- 
ing of a complex system by means of interrelated mathe- 
matical functions. Such an abstraction attempts to cap- 
ture the essence of the manner in which various factors 
affect the system both directly and in combination with 
other variables. The model may be cast into either a 
descriptive or normative framework. Wagner [77] points 
out two basic caveats for such an approach: 

1. If uncertainties are inherent in the model 
(e.g., the simulation of demand for service 
based on some probability distribution) re- 
sults are subject to statistical error; 

2. In order to make the system model computa- 
tionally tractable, much abstraction and many 
simplifying assumptions may be required, with 
the undesirable result of divorcing the model 
from reality. 

It seems clear that the authors cited below were aware of 
such difficulties. 

Turner, et al. [74] 

Noting the need in a nursing home environment for 
the determination of approximate resource levels (nursing, 
equipment, non-medical support, and outside medical 
assitance) , and a reasonable scheduling system for such 
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resources as could be allocated. Turner, e^t ad. proposed 
two simulation models with these goals in mind. These 
models incorporated a list of 28 direct care activities 
and 22 supporting (or indirect care) activities, and 
showed that about 80% of such activities could be sched- 
uled, the remainder occurring more or less randomly. As 
stated by the authors, the models do not allow for para- 
metric examination of the task performance time estimates 
used. Proposed measures of effectiveness by which vari- 
ous configurations were compared were the following: 

1. Patient waiting time for service; 

2. Specific task/skill level assignment patterns; 

3. Amount of staff idle time; 

4. The attainment of minimum service requirements. 
McKnight and Steorts [43] 

The model proposed by McKnight and Steorts repre- 
sents a significant movement of simulation studies toward 
reality. Quite explicit descriptions of the nursing home 
environment in terms of number and size of units, patient 
mix (as per McKnight' s classification system cited earl- 
ier) , services offered, scheduling and occurence intervals 
of demands for service, staffing constraints, and task/ 
skill level preferences are considered. 



Beginning with the individual patient care plans 
translated into specific tasks and then aggregated within 
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classification levels, the model generates demands, both 
scheduled and random, with required service times taken 
from a normal curve fitted to task time-study data. To 
each demand is attached a priority, being a function of 
both patient classification and the approach of a cutoff 
time beyond which the demand would be unmet. Specific skill 
level assignments are made on the basis of a first and 
second choice preference structure. 

In addition to generating financial reporting in- 
formation, the model also reports the following: 

1) Percentage of patient demands met, both with 
and without waiting; 

2) Percentage of first and second choice person- 
nel assignments; 

3) Hours of care assigned by task/skill level; 
and 

4) Staff utilization information. 

Thus, from the manager's point of view, the relative 
effectiveness of various staffing configurations might be 
tested by making several model runs and comparing them 
on the basis of the above criteria. 

Hundert [31] 

In a very recent study, Hundert describes a two- 
phase nursing home simulation model whose dual objectives 
are the provision of the maximum amount of care at 
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minimum cost. The system takes as its input the follow- 
ing parameters: 

1) Schedule of services— exact times or time 
intervals for task performance; 

2) Server preferences — a first and second choice 
by task and patient classification; 

3) Task priorities — set initially by the user, 
then updated in much the same way as in the 
McKnight and Steorts model; 

4) Patient demand profile — a translation of the 
patient care plans in terms of the probabili- 
ty that a patient would require one of the 
specified services, with frequency and sched- 
uling information also provided. 

Through the use of time-study data, Hundert modeled ser- 
vice time distributions as gamma functions, which the 
simulation model can use to infer required task perform- 
ance times. 

Based on the output of this portion of the simu- 
lation, the cost phase of the model predicts average cost 
per patient day by means of linear regression estimates 
derived from historical data. Not surprisingly, it was 
found that nursing salaries, facility capacity and age, 
and patient census were the most important factors in the 



cost model. 
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The entire model was validated and used to deter- 
mine the optimal unit size, staffing mix and levels, 
and service schedules for a proposed 180-bed nursing home. 
Acceptable results are reported for the first portion of 
the model, but cost estimates were deemed to be in ques- 
tion . 

Liebman [38] and Liebman, et al, [39] 

The work of Liebman represents a significant de- 
parture from earlier nurse staffing studies in the long- 
term care field in its reliance on the use of integer 
programming allocation and assignment models. In an 
attempt to develop "team effectiveness profiles" for com- 
parison of various nursing team configurations in a 
hospital-based extended care unit (ECU) , a short-term 
(or daily) assignment model and a long-term allocation 
model were proposed. Taking i as the index of individual 
patients, j the index of skill levels, and k the index of 
nursing tasks, the short-term model is formulated as 
MAX 



s . t . 
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and the long-term model is 



MAX 



: * * ^ £ c -i 1^ 



j k 



j k j k 



s.t. £y j k = l, 

£ T, y - S . , 
v k D k 9 



y., e 0 



V k 



(II. 7) 



The variables x. in (II. 6) indicate the assignment 

1] K 

pattern of nursing skill levels to specific tasks and 

patients, where x„^=l implies the assignment is made. In 

(II. 7), the y., denote the fraction of demand for task k 
1 K 

allocated to skill level j. The c., are skill level-task 
preference measures solicited from professional nurses by 
means of a Q-sort psychometric measurement technique. 
Although interval scaled measures were thereby obtained, 
equally acceptable results using corresponding ordinal re- 
lationships were reported. The t^ and T^ are times re- 
quired to perform specific tasks on an individual patient 
level or aggregated over all patients, respectively. 

These were determined by asking nurses how long each pro- 
cedure usually took. Nursing time available to devote to 
direct care activities, S j , was determined by aggregating 
task performance times for each skill level, again accord- 
ing to current assignment procedures. The constants c* 
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and c** are normalizing factors. Both models require 
that all tasks demanded must be accomplished. A com- 
binatorial method relying on complete exhaustion of 
available time supplies was used to solve (II. 6) , while 
(II. 7) was easily formulated as a network flow problem. 

The validity of the model was tested by comparing 
the most effective assignment plans and team configura- 
tions computed with existing practices on the unit. Pro- 
fessional nurses were asked to state their preference 
between pairs of such results, with the model solutions 
usually being favored. 

While this study represents a significant effort, 
it does leave room for improvement. First, its concepts 
are definitely hospital-oriented, albeit for an ECU, and 
the findings may therefore pose some difficulties for im- 
plementation in the typical long-term care facility. Task 
performance times and the method of computing the tend 
to cause the solution to reflect heavily the status quo. 
Additionally, it is implicitly assumed that existing 
patient care plans may be readily formulated for use in 
the model in terms of the nursing task list presented. 
Finally, no provision is made for determining how con- 
strained nursing resources can affect the amount of direct 
care given, and how choices related to task priorities 
for various patient types might be made and evaluated. 
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Mehta [44] 

We briefly note a recent study by Mehta, which 
essentially quantified existing staffing and assignment 
procedures in sample facilities and categorized activi- 
ties as direct care, indirect care, travel, communication, 
and personal time. By work sampling, the time devoted to 
each task was obtained and an indication of the skill 
levels performing each task was given. The staff mix was 
determined by simply dividing the aggregate time spent by 
each skill level by seven hours to obtain full-time 
equivalent personnel. The use of a float staff was 
recommended to handle time for which no accounting could 
be made by the above procedure. Not surprisingly, the 
staffing recommendations that were provided closely re- 
semble the staff mix that existed on the units at the 
time of the study. 

II. 6 Work of the "Four University" Group 

We conclude our survey by noting the contribu- 
tions made in several areas of the long-term care field by 
the so-called "four university" group. Established in the 
late 1960s, the study team consisted of those researchers 
listed in Table II. 8 with primary research interests as 
indicated. Recognizing the many schemes for long-term 
care patient classification already in existence, the 



52 



Table II. 8 

The "Four University" Research Group 



Group Location 



Principal Investi- Purpose of Primary 
gator Interest 



Case-Western Dr. S. Katz* 
Reserve Dept. Dr. M. Stroud** 
Preven . Med . 



Studies of course of 
illness in chronic 
disease 



Harvard Center Dr. Paul Donsen 
for Community Mrs. Ellen Jones 
Health and 
Medical Care 



Evaluation of programs 
for care of the 
chronically ill 



Hospital Assn. Mr. L. Danehy 
of New York 
State 



Community resource 
allocation 



Johns Hopkins Dr. C. Flagle 
Sch. of 

Hygiene & Pub. 

Hlth. Opera- 
tions Research 
Division 



Facilitating decision 
making in management 
of patients 



* Now at Michigan State University 
** Now at University of Pennsylvania 



group saw its main task as the development of an assess- 
ment system that would not only capture the important 
aspects of then current systems, but also establish a 
"common language" of patient classification that would be 
interdisciplinary in scope. The result was the Collabor- 
ative Patient Assessment Instrument (CPAI) alluded to 
earlier, along with its accompanying user's manual [35]. 

A detailed discussion of the CPAI will be undertaken in 
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the next chapter. For the present, it is sufficient to 
call attention to the statement of Jones [34] , which 
points out the significance of the CPAI as a patient- 
oriented rather than process- or service-oriented system. 

Although the original group has officially dis- 
banded, the individual research teams have continued to 
pursue their interests in the LTC field in a collabora- 
tive manner. Among those studies currently underway are 
the improvement of the assessment tool as a means for im- 
proving the quality of patient assessment, and additional 
work on the relationships between patient classification, 
demands for care, and facility nurse staffing. 

I I. 7 Summary 

This chapter has attempted to describe briefly 
some relevant studies concerning patient classification 
and nurse staffing in both the acute and long-term care 
environments. Commonalities in conceptual approaches, 
significant improvements, and areas for further explora- 
tion have also been highlighted. It is against this 
background that the models and procedures proposed in this 
dissertation are presented. It is hoped that the his- 
torical perspective that has been provided will prove 
helpful in evaluating the new methodologies. 



Chapter III. Development of the Methodology 



III.l Introduction 

Having discussed the history and current state of 
the art of nurse staffing methodologies we turn now to 
the development of the proposed models. Recall that we 
essentially are seeking a process that will assist a LTC 
facility administrator in solving personnel budgeting 
problems by presenting alternative courses of action for 
consideration. Unlike some previously designed systems 
of this sort, however, the approach suggested here relies 
heavily on the concept of patient needs as related to de- 
mand for nursing service. As a result, supplementary in- 
formation is concurrently produced that will benefit the 
director of nursing in allocating the predicted nursing 
resources . 

After an initial discussion of the general prob- 
lem to be solved, we will demonstrate how patient needs 
may be related to nursing activity through classification. 
The insights obtained will lead directly to the construc- 
tion of a mathematical programming model and two modified 
formulations derived from it. These models will serve as 
the basis for solving the budgeting and nurse staffing 
problems . 
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III. 2 The Administrator's Problem 

As was pointed out in Chapter I, the LTC facility 
administrator, as the primary managerial decision-maker, 
is responsible for the total allocation of available re- 
sources so that the essentials of institutional life may 
be provided. Given that the bulk of operating expenses 
are attributable to personnel costs, especially those of 
nursing staff, we chose to concentrate primarily on the 
manner in which such expenses could be related to the 
provision of service. Recall, too, that the five con- 
siderations identified by Aydelotte [5] (quality of care, 
patient characteristics, staffing predictions, staffing 
logistics, and quality measurement) were postulated as 
being of consequence in any budgeting process.* Addi- 
tionally, certain exogenous variables must be considered 
by the administrator in reaching an acceptable decision 
as to personnel budget and the staffing mix it would imply. 
These variables include the following: 

1. State and federal requirements for minimum 
staffing and skill level coverage (e.g., R.N. 
on duty at least one shift per day, adherence 
to a staf f-to-patient ratio of 1:25, etc.); 

2. Reimbursement rates for Medicare/Medicaid 
patients ; 

3. Demand intensity implied by the facility's 



*These items were discussed in Chapter I. 
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patient mix. 

The synthesis of all of the above-mentioned fac- 
tors into a normative, decision-making framework, along 
with such intangibles as the desire to maintain a good 
professional reputation, is part of what Baloff, Aber- 
nathy, and Hershey [8] identify as the "aggregate budget- 
ing process." The decision variables involved in this 
process are the budget to be allocated to nurse staffing 
and the staffing levels and skill mix to be maintained. 
Essentially, then, the administrator's problem may be said 
to be 

How to fix budget and staffing levels in a manner 
responsive to patient needs and other factors, 
both for the short run and as a result of future 
alterations in the exogenous and/or endogenous 
considerations mentioned earlier (e.g., legal re- 
quirements, reimbursement rates, patient mix, 
etc . ) . 

Baloff, et al. [8, p.4] succinctly capture the essence of 

why this problem is non-trivial: 

. . .The difficulties of nurse staffing planning are 
partially caused by two insurmountable problems 
for which no complete remedies exist: (1) the 

impossibility of developing perfect demand fore- 
casts for the future, and (2) the inability of 
specifying exactly and unequivocally the manner 
in which demand should be translated into required 
staff . 
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To these points we might also add the inherent pitfalls 
in the intermediate step of relating patient health status 
to demand . 

Recognizing the major areas of difficulty in nurse 
staffing planning, as noted above, it is perhaps wise to 
view the administrator's problem in terms of the resolu- 
tion of these problems. Certain of the exogenous changes 
to which a response is desired may be captured in the 
model which translates demand into required staff. The 
relationship between patient health status and demand, 
along with the development of future demand forecasts, 
may be viewed as a separate but interrelated entity. 

While we do not propose to predict future patient status 
in a manner analogous to Singer [68], we may rely on the 
relative stability of the patient mix within LTC facili- 
ties to lessen the need for such a procedure. Then, too, 
a method of assessing patient health status which may be 
readily applied at frequent intervals decreases the neces- 
sity for long-range demand forecasting, since it is 
usually the case that such predictions are required in the 
absence of better information. With this overall solution 
framework in mind, we turn now to the development of a 
demand-related patient classification system. 
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III. 3 A Patient Classification System 

The lack of a truly useful patient classifica- 
tion methodology for our purposes in the long-term care set- 
ting has been previously noted. Such systems as do exist 
fall short of the goal of relating patient health status 
to demand for nursing service for essentially two reasons: 

(1) health status is, in fact, not adequately reflected, 
and (2) demand predictions are overly simplified and not 
readily applicable to the aggregate budgeting process. By 
way of example of the first shortcoming, we cite the use 
of patient "level of care" placement. Since the place- 
ment decision must, of necessity, include the considera- 
tion of several other factors in addition to patient 
health status, these factors may at times be so important 
as to alter any decision based solely on health status. 

Then, too, the time lags involved in the reassignment of 
a patient whose health status has changed can cause the 
current placement to be a poor predictor of nursing needs. 
The second shortcoming is exemplified by the PETO system, 
where the predicted demands for nursing services are 
stated in terms which do not allow for determination of 
the skill level mix as it relates to budgeting. It should 
be noted that other methods, such as those proposed by 
Burack [13,50] and Katz [36] were never shown to be re- 
lated to demand for nursing service. 
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Perhaps the ideal vehicle for reaching our stated 
goal is the intermediate step of a patient care plan. 
Having evolved from the traditional nursing care plan by 
including patient data of interdisciplinary interest, 
the patient care plan in its most recent form is usually 
presented in a "problem-oriented" format, and relies on 
a thorough consideration of patient status. Such a 
structure requires recording of both subjective and ob- 
jective problem identification and assessment, a plan of 
approach, and the specific actions required to ameliorate 
the difficulties. Often included are expected goals to 
be attained as a result of treatment. The implementation 
of this approach at the nursing level is shown sche- 
matically by Bower [10, p.10] in Figure III.l. 

Figure III.l 

Bower's Process of Planning Nursing Care 
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It is of note that this form of patient care plan in- 
corporates a one-to-one correspondence between patient 
status and nursing demands. Aggregated over all patients 
in a facility, these demands would provide a reasonable 
indication of required staffing. There are, however, 
three drawbacks to this approach: (1) skill level prefer- 

ences for task assignments are often omitted, (2) exami- 
nation of alternative budget levels and skill level mixes 
would be difficult, and (3) the complex relationship 
between the myriad of possible nursing actions and per- 
formance times would be difficult to identify and stan- 
dardize. On a more mundane level, the problem-oriented 
patient care plan has yet to be fully implemented in a 
significant number of long-term care facilities. Indirect 
use of care plans through translation of indicated activi- 
ties into tasks listed on a predetermined schedule has 
been suggested by Liebman [38,39] and in the several simu- 
lated models cited in Chapter II. Such procedures, how- 
ever, can be extremely time-consuming, and can require a 
great deal of professional nursing judgment to implement. 

Given the state of the art in patient classifi- 
cation systems and the inherent difficulties in the use 
of patient care plans for budgeting purposes, there is a 
need for a classification system emphasizing ease of 
application and comprehensive assessment and demand 
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prediction. That is, the categorization of patients into 
distinct, homogeneous groups should take place with the 
following criteria in mind: 

1. The classification decision should have its 
roots in a comprehensive assessment of a 
patient with respect to functioning status, 
impairments, medical status, psycho-social 
status, and medically defined conditions; 

2. The classification groupings must bear some 
well-defined relationship to demand for nurs- 
ing services. 

In keeping with the first of these guidelines, the 
proposed classification system is to be based on the 
assessment information contained in the Collaborative 
Patient Assessment Instrument (CPAI) developed by the 
"four university" group.* A listing of the classifica- 
tion descriptors is given in Table III.l, and an illustra- 
tion of the instrument itself is shown in Appendix A. As 
can be seen from the descriptors included, the overall 
theme of the assessment is problem-oriented instead of 
diagnostically dependent. Jones [35] further identifies 
the following four characteristics of the descriptors: 



* The organization of this group was discussed in 
Chapter II. 
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Table III.l 



CPAI Classification Descriptors 



Identifying and sociodemographic items : 
Date of classification 
Interviewer or Classifier 
Program or project identification 
Patient’s record or study number 
Social security number or health 
insurance benefit number 
Birth date 
Birthplace 
Sex 
Race 

Marital status 
Religious preference 
Residence address 
Patient’s location at time of 
assessment 

Length of time at location 
Usual living arrangements 
Number of living children 
Education 
Usual occupation 
Employment status 
Total family income 
Health care coverage 

Functioning status items : 

Mobility level 

Transferring 

Walking 

Wheeling 

Stair climbing 

Bathing 

Dressing 

Ea t ing /feed ing 

Toileting 

Bowel function 

Bladder function 

Communication of needs 

Orientation as to time, place, person 

Behavior pattern 



Imp a irmen t it ems : 

Sight impairment 
Hearing impairment 
Speech impairment 
Fractures and disloca- 
tions 

Joint motion disorders 
Joint pain and swelling 
Missing limbs 
Paralysis /pares is 
Dentition 

Medical status: risk factor 
measurements : 

Height 

Weight 

Blood pressure 
Blood cholesterol 
Kidney function measure- 
ment: BUN 
Albuminuria 
Cigarette smoking 

Certain medically defined 
conditions : * 

Alcoholism 

Anemia 

Angina and/or myocardial 
infarction 
Arthritis 

Cardiac arrhythmias 
Congestive heart failure 
Diabetes mellitus 
Drug abuse 
Hypertension 
Malignancy 
Mental illness 
Neurologic disorders 
Respiratory disease, 
chronic 



* In addition, all medical diagnoses may be listed. 
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1. Patient orientation: the individual is 

described as he or she actually i£, not in 
terms of services being rendered or physical 
location ; 

2. Multidimensionality: The scope of the assess- 

ment is a broad one; 

3. Objectivity: little subjective or interpre- 

tive analysis is required, hence results are 
reproducible ; 

4. Relevancy: the items included have been shown 

to be strongly related to the major issues of 
morbidity and mortality. 

In addition, the subcategorizations provided with the 
functioning status, impairment, and medically defined 
items serve to indicate the patient's degree of dependency 
with respect to each descriptor. The incorporation of 
the suggested coding provided in the CPAI User's Manual 
[35] thus yields an ordinally scaled set of variables for 
purposes of analysis. 

As part of their developmental work on the CPAI, 
three of the research groups undertook the field testing 
of the instrument in various locations. As a result, a 
total of 623 assessments on Medicare/Medicaid patients 
was obtained, the sample consisting of those patients 
whose placement levels were as shown in Table III. 2. As 
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Table III. 2 

CPAI Sample Categorized by Level of Care and Research Group 



Level of Care 


Research Group 

HANYS Johns Hopkins Harvard 


Total 


Private Residence 




47 


... 


47 


Rented Room(s) 


2 








2 


Domiciliary /Personal 
Care Facility 




18 




18 


ECF/Supportive Nursing 


30 


70 


... 


100 


Nursing Home 


63 


53 


. 


116 


ECF/ECU 


31 


3 


_ 


34 


Chronic Disease/ 

Rehabilitation Hospital 




11 


295 


306 


Total 


126 


202 


295 


623 



an additional and highly important part of this study, 
criteria for judging a patient's most appropriate level 
of care were established and utilized by various health 
professionals to obtain an indication of the divergence 
between legal placement and placement based on judgment 
of patient needs. The resulting categorization, which 
illustrates the observed divergence, is shown in Table 
III . 3 . 

The appropriate placement decision was made, for 
the most part, on the basis of the patient's health and 
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Table III. 3 

CPAI Sample Categorized by 
Appropriate Level of Care and Research Group 



Appropriate Level of Care 


HANYS Johns Hopkins Harvard 


— , 

Total 


Private Residence/Room/ 
Home with or without 
Home Care 


1 


59 


4 


64 


Personal Care Facility 




15 


5 


19 


ICF/Supportive Nursing 
Care Facility 


34 


77 


54 


165 


Skilled Nursing Home 
(ECF/ECU) 


91 


45 


68 


205 


Chronic Disease/ 
Rehabilitation Hospital 




6 


157 


163 


Other Hospital 


,,,,,, 





7 


7 


Total 


126 


202 


295 


623 



mental status. Where there were some overwhelmingly im- 
portant factors among the patient's sociodemographic 
characteristics (e.g. relatives living at home, insurance 
arrangements, etc.), the placement decision may have been 
altered. Similarly, borderline placement questions were 
usually resolved by recourse to this information. By and 
large, however, the adjudged appropriate level of care 
provides the high degree of association with patient 
status (obtained through assessment) that we seek in 



66 



keeping with the first criterion for classification. 

Quantification of the relationship between this 
judgmental variable and the multivariate indication of 
health status provided by the CPAI can therefore become 
the basis for an operable patient classification system. 
Due to the nature of the data involved, and in keeping 
with the need for a readily-applied methodology, a re- 
gression-based multivariate statistical approach has been 
utilized. Further explanation of specific methods, 
rationale for the approach, detailed description of the 
application of the method of choice, and the resulting 
classification system are provided in Chapter IV. 

There remains the consideration of the link 
between the derived classification system and nursing 
service demands. When others have attempted to quantify 
these demands it has typically been the case that the 
time necessary to perform the several tasks identified 
with the functions of nursing are presented. These per- 
formance times are usually, although not necessarily, 
obtained from time-study experiments, some typical ex- 
amples being given in Liebman [38,39], McKnight and 
Steorts [43], and Hundert [31]. Unfortunately, the task 
listings usually contain anywhere from 65 to 75 items 
which, when combined with the various patient classifi- 
cations and nursing skill levels in a staffing model, can 
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lead to models too large for most computers to solve. 

In a nursing home research study conducted in 
Denver, Colorado, however, McKnight [41] proposed the 
grouping of tasks typically performed in long-term care 
facilities into 25 "care areas" as shown in Table III. 4. 
Notice that in addition to tasks which have traditionally 
been identified as "direct care" items (i.e., performed 
in the presence of the patient) , there is also involved 
an area identified as "associated nursing care functions." 
The constituent tasks of this grouping, such as charting 
and rounds, are certainly more patient-oriented than 
other "indirect care" tasks performed by nurses. For this 
reason McKnight [42] identifies the 25 care areas as 
"patient centered" instead of as direct care groupings. 

A major portion of the Colorado work was devoted 
to time studies of nursing activities in 14 homes general- 
ly judged as giving "good" quality care. Specifically, 
the amount of time devoted by nursing personnel to the 
constituent tasks of the 25 care areas over a 3 1/2-day 
period for 195 sample patients was obtained. The findings 
shown in Table III. 5 indicate the average time, per 
occurrence , for each care area by patient classification 
(minimum care, moderate care, maximum care). Although 
this is a common way of presenting this type of result, 
such data are difficult to apply without prior knowledge 
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Table III. 4 

Nursing Care Areas Identified by McKnight 



01. ELIMINATION 

1. Toilet 

2. Commode 

3. Bedpan 

4. Urinal 

02. CATHETERIZATION (and related procedures) 

1. Catheterization 

2. Insertion of Foley catheter 

3. Bladoer irrigation 

4. Emptying drainage bottle 

03. DAILY CARE 

1* Morning care 

2. Evening care 

3. Rest period 

4. Night check 

04. BATHING 

1. Complete bed bath 

2. Part l al bed bath 

3. Partial bath 

4. Shower 

5. Complete tub &ath 

05. BED MAKING 

1. Occupied 

2. Unoccupied 

3. Straightening beds 

06. CARE OF SKIN 

1. Preventive skin care 

2. Treatment of decubitus 

3. GE NT LE MASSAGE 

07. CARE OF HAIR AND FACE 

1. Combing and braiding 

2. Shampooing 

3. Cutting 

4. Setting hair 

5. Scalp treatment 

6. Shaving 

08. ORAL HYGIENE 

1 . Brush i ng teeth 

2. Denture care 

3. Mouth care 

4. Care of lips 

09. CARE OF NAILS 

1 . Fl NGERNAl LS 

2. Toenails 

3. Manicure or pedicure 

10. REHABILITATION and RECREATIONAL 

1. Passive exercise 

2. Supervised exercise 

3. Diversional 

11. FOOD anO NOURISHMENT 

1 . Breakfast 

2. Lunch 

3. Dinner 

4. Extra feedings 

5. Water 



12. MEDICATIONS 

1 . Oral 

2. Intramuscular 

3. Subcutaneous 

4. I 4-ST I ELAT ION. OF. DROPS 

5. Intravenous 

6. Rectal 

7. Inunction 



13. CARDINAL SYMPTOMS 

1. Mouth temperature 

2. Rectal temperature 

3. Radial pulse 

4. Blood pressure 



14. 



15. 



THERAPEUTIC MEASURES 



1. Hot water 


bottle 


2. Ice bag 


6 . 


Ul tra Sound 


3. Compress 


7. 


Diathermy 


4. Hot soak 

5. Heat lamp 


8 . 


Whirlpool 




9. 


Bandages 


URINE TESTS 






1. Clinitest 






2. Acetone test 




3. Testape or 


URl ST 1 X 



16. ENEMA 

1. Cleansing 

2. Retention 

3. Removal of fecal impaction 

4. Insertion of colon tube 



17. CARE OF COLOSTOMY or I LEL0ST0MY 

1. Change of dressing 

2. Irrigate in bed 

3. Irrigate In toilet room 



IB. ASSOCIATED NURSING CARE FUNCTION 

1. Telephone contact or request 

2. Personal contact 

3. Assistance with procedures 

4. Charting 

19. WEIGHING PATIENT 

1 . At be DS I DE 

2. I N WE I GHl ng ROOM 

20. GAVAGING PATIENT 



21. FEMALE PROCEDURES 

1. Vaginal douche 

2. Perineal care 

22. OXYGEN THERAPY 

1 . Cannula 

2. Catheter 

3. Mask 

4. Tent 

23. CARE OF CRITICALLY ILL 

1 . Acute conoi t ion 

2. Comatose condition 

3. Dying conoition 

24. PREPARING BODY AFTER DEATH 



25 



MISCELLANEOUS 



Table III. 5 

Mean Time* Needed to Complete Each of the Nursing Care Areas by Personnel 
Level and by Patient Ambulatory Status and Nursing Need as Found by McKnight 

OVERALL PERSONNEL LEVEL 1! AMBULATORY STATUS|| NURSING NEED 
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of the existing individual patient care plan; that is, 
an indication of the potential number of occurrences of 
a care area for each patient. Since our aim here is to 
relate demand to homogeneous patient categories, per- 
formance times for a "typical" or "average" patient of 
each group would be the more appropriate measures. In 
order to derive such information, the source data for the 
Colorado study were obtained, and aggregate times by care 
area for all patients found. These times include an 
allowance for the procedural steps of each activity with- 
in a care area that were not performed but were deter- 
mined by professional judgment to be necessary. The use 
of time information in this form is appropriate to the 
normative models to be proposed. At this point, the 
aggregates were divided by the number of patients in each 
category, and then divided again by 3 1/2 days. The 
resulting information is thus interpreted as the average 
required performance times to be devoted to each care 
area for an "average" patient of each category per 24-hour 
day. Note that we are implicitly assuming that the pat- 
tern of demands as demonstrated in the sample data is a 
reasonable representation of the pattern that would exist 
in populations of similarly classified patients. Com- 
pletely random selection of the patient sample for the 
original study lends credence to this assumption. The 
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average required care area performance times by patient 
classification are given in Table III. 6. As a matter of 
interest, the average total nursing time per patient day 
obtained by summing the care area performance times is 
.61 hours for minimum care patients, 1.55 hours for moderate 
care patients, and 2.77 hours for maximum care patients. 

Recall that in Chapter II we noted that McKnight's 
tri-level classification system may be interpreted as 
being analogous to the three levels of care denoted as 
Intermediate B, Intermediate A, and Skilled Nursing care. 

For the appropriate level of care used in our classifica- 
tion system the analogous levels as shown in Figure 
III. 3 are Personal Care Facility, ICF/Supportive Nursing 
Facility and Skilled Nursing Home (ECF/ECU) . The result 
we seek in keeping with the second of our classification 
system criteria is thus immediate. The implicit demand 
for patient-centered nursing services based on patient 
health status can be inferred by first classifying the 
patient into one of the three categories (Int. A, Int. B, 
Skilled) , and then utilizing the average required per- 
formance time information available from Table III. 6 for 
the corresponding classification. Since this time/demand 
data is for an "average" patient, the result is most 
powerful when used to determine demands by care area and 
patient classification for the residents of a large unit 



Average Required Performance Times Per Day (in Minutes) 
by Care Area and Patient Classification 
(Based on McKnight f s Colorado Study Data) 





Care Area 


Patient 

Minimum 

(N=64) 


Classification 

Moderate 

(N=95) 


Maximum 

(N=36) 


1 . 


Elimination 


.53 


4.94 


8.24 


2. 


Catheterization 


.02 


1.00 


5.84 


3. 


Daily Care 


2.36 


23.23 


43.79 


4. 


Bathing 


2.59 


8.09 


13.84 


5. 


Bed Making 


3.46 


6.11 


6.41 


6. 


Care of Skin 


.05 


1.79 


11.10 


7. 


Care of Hair and Face 


.75 


2.89 


3.02 


8. 


Oral Hygiene 


.02 


.58 


1.20 


9. 


Care of Nails 


.13 


.35 


.35 


10. 


Rehabilitation and 
Recreation 


1.31 


4.61 


6.68 


11. 


Food and Nourishment 


8.56 


17.35 


36.32 


12. 


Medications 


5.99 


9.27 


10.45 


13. 


Cardinal Symptoms 


.61 


.90 


2.90 


14. 


Therapeutic Measures 


.04 


.44 


1.92 


15. 


Urine Tests 


.17 


.49 


.18 


16. 


Enema 


.12 


1.00 


1.00 


17. 


Care of Colostomy or 
Ileostomy 


* 


.21 


* 


18. 


Associated Nursing 
Care Function 


4.13 


6.17 


6.21 


19. 


Weighing Patient 


.04 


.04 


* 


20. 


Gavaging Patient 


* 


. * . 


* 


21. 


Female Procedures 


* 


.02 


* 


22. 


Oxygen Therapy 


.12 


.09 


.16 


23. 


Care of Critically 111 


.20 


.11 


is 


24. 


Preparing Body After 
Death 


* 


* 


is 


25. 


Miscellaneous 


5.05 


5.50 


6.57 



* Not observed. 
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or an entire facility. The rationale is that the aver- 
aging process used in the derivation of required per- 
formance times would be replicated if a large enough 
patient population were considered at the facility level. 

The need for a demand -related classification sys- 
tem has thus been met. Note that since we are consider- 
ing a facility oriented problem the "home care" classifi- 
cation is not considered. On the other hand, neither do 
we consider the "chronic care" category, mainly because 
of the lack of supporting time data. Nevertheless, the 
proposed system is well suited for the majority of long- 
term care facilities and is in consonence with the trend 
toward multi-level certification. In the next section 
we shall demonstrate how the demands derived by this 
method may be allocated among available nursing person- 
nel . 

III. 4 A Basic Staffing Model 

Given that we now have a means whereby patient 
needs may be translated into demand for patient-centered 
nursing activities through the intermediate steps of 
classification and aggregation, we now examine how this 
demand is related to staffing. This issue is the heart 
of the facility administrator's problem. Recall that the 
basic decision variables under the administrator's 
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control are the budget to be allocated to nursing staff 
and the number and skill level mix of nurses to be main- 
tained. An additional decision to be reached is how the 
available nursing resources may be assigned to various 
nursing functions. This latter concern is usually with- 
in the purview of the Director of Nursing. 

Decisions must be made, however, in an environ- 
ment constrained by several factors. Alluded to earlier, 
these include the following: 

1. An upper limit on funds which may be devoted 
to staffing; 

2. Legal staffing minimums and guidelines; 

3. Satisfaction of patient needs through nursing 
intervention; 

4. Limitations on the amount of nursing time 
available to devote to patient care due to 
other staff responsibilities; 

5. Assurance that patients with high priority 
nursing needs are not neglected. 

With all such restrictions in mind, decision variables 
may be fixed at those levels which act to optimize what- 
ever criterion is considered most important. If dollar 
cost of nurse staffing is chosen, for example, it is sub- 
mitted that a minimum cost solution would not be diffi- 
cult to find by using legal constraints as a basic 
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quideline and heuristic procedures thereafter to meet as 
many other restrictions as desired. We propose, how- 
ever, a normative approach using as a criterion the 
appropriateness of assignment of nursing personnel as a 
surrogate measure of quality of care. 

The techniques of mathematical programming are 
well-suited to model building of this type; that is, the 
optimization of a criterion function subject to certain 
constraints, both of which may be expressed mathematical- 
ly. Taking personnel budget as a parameter and staff 
numbers, skill level mix, and assignments as variables, 
the postulated model is cast into the mixed-integer lin- 
ear programming form 
MAX cx 
s . t . Ax = b 

Xj > 0 and integer, V j €. 

Xj > 0, Vje N 2 



(III.l) 
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Specifically, the Basic Staffing 


Model ( BSM) 


is given as 


MAX c . . , x . . + << p . 

-■f- i]k ink *.f- 

ljk J J jk J 


k£jk 


(III. 2) 


s.t. <Ts^n^ Budget 




(III. 3) 


<< x. .. -a.n.^0, 

ink li— 

D k J 


Vi 


(III .4) 


2 x. - fl .. =0, 

i il k P D k 


V j ,k 


(III. 5) 


L.^n.^U. and integer 

l—i—i 


, Vi 


(III. 6) 


x ijk^ °' 


V i, j ,k 


(III. 7) 


L jk— Pjk— ^jk' 


V j ,k 


(III. 8) 



where 



i=l,2,...,I indexes the various nursing personnel 
skill levels, j=l,2,...,J indexes the care areas, and 
k=l,2,...,K indexes the levels of patient classification. 
Further, the following quantities are defined: 
c. = a constant measuring the appropriateness per unit 

1] K 

time of having skill level i assigned to care area 
j for patient class k. c.^ k >0 



P jk 



= a dichotomous (0,1) constant, with a value of 1 
denoting the effectiveness per unit time of a care 
area/patient classification combination deemed to 
have a high relative priority, and a value of 0 
denoting a low priority combination. 
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x 



ijk 



s 



i 



n 



a . 
1 



Pjk 




L 



jk 




a variable which indicates the amount of 
time devoted by skill level i to care area 
j and patient class k in a model solution, 
a constant representing the dollar cost per 
unit time for skill level i. 
a variable delineating the number of mem- 
bers of skill level i to be included in the 
staff mix. 

a constant indicating the number of time 
units available for assignment to patient 
centered activities for skill level i (i.e., 
total available time less time needed for 
other nursing and personal activities) . 
a variable specifying the amount of time 
allocated by the model solution to care area 
j for patient class k. 

upper and lower restriction constants on 
the number of personnel of skill level i 
which may be included in the model, 
upper and lower restriction constants on the 
amount of time to be allocated to care area 
j for patient class k. 



With this information available, we examine the constitu- 
ent parts of the Basic Staffing Model to illustrate how 
each relates to the administrator's problem. 
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The objective function (III. 2) indicates the 
maximization of the sum of both a time-related appropri- 
ateness measure over all possible skill level/care area/ 
classification combinations and a time-related priority 
measure. The meld of skill levels with jobs produced 
serves as a surrogate measure of quality of care in keep- 
ing with the findings of Revans [57,58]. More will be 
said on the priority aspect below. In addition, Liebman 
[38,39] demonstrated that an objective function given in 
terms of performance times produced superior assignment 
patterns to objective functions without such a feature 
when used in models of this type. 

Constraint (III. 3) simply assures that the upper 
limit on personnel budget will not be exceeded. In order 
to insure against assigning more time than is available 
for any skill level, the constraints (III. 4) restrict 
all possible assignments of skill level i to be less than 
or equal to the time available for patient-centered 
activities . 

The constraint sets (III. 5) and (III. 8) when con- 
sidered together serve a dual purpose. The first of these 
is to assure that when the assigned times of all skill 
levels to care areas and patient classifications are 
added, patient demands for all combinations are satisfied. 
Note, however, that the demands themselves are model 
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variables, albeit with certain minimum times which must 
be met and maximum times that cannot be exceeded. The 
maxima and minima for every care area/classification/ 
demand combination serve as a starting point for the con- 
sideration of priority of patient needs. 

That some activities are given priority over 
others in the resource constrained environment of a long- 
term care facility is a well-known fact. Little and 
Carnevali [40, p. 53] suggest that attention should be 
given to,"... those problems which are most relevant to 
the patient's well being, keeping in mind the realities 
of staffing levels and the abilities of those who con- 
tribute to patient care." They go on to suggest the 
existence of a hierarchy of patient needs within which 
physiological problems would be considered first, fol- 
lowed by safety, love, esteem, and self-actualization. 
Bower [10], on the other hand, postulates a hierarchy of 
needs wherein problems threatening a patient's life, 
dignity, or integrity are considered first; with problems 
which may destructively change the individual considered 
next; and those affecting the developmental growth of the 
individual following. It is emphasized, however, that 
priority setting should not negate the importance of 
those demands considered to be of lower priority, but 
rather to provide a means by which demands may be 
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satisfied efficiently. 

With these ideas in mind, we can incorporate a 
rudimentary priority system into the Basic Staffing 
Model in the following manner. Assume that we can 
dichotomize the set of care area/patient classification 
combinations into a set containing those combinations of 
relatively high priority and a set with combinations of 
lower priority. Guided by the average performance times 
as shown in Table III. 5, the upper and lower bounds on 
are set in a manner indicative of the priority class 
for (j,k). That is, relatively high priority combina- 
tions might have their lower bounds set at or above the 
average time, and the upper bound set at some greater 
value. Lower priority items, on the other hand, might 
have their lower bounds set below the average time, and 
the upper bound at or near the average. Such a framework 
conceptually insures that, in a situation of constrained 
resources, high priority combinations will receive an 
adequate allocation of nursing time, while lower priority 
combinations will be allocated at least their lower bound. 

Given the dichotomy of care area/patient classi- 
fication combinations and the loosely defined priority 
system it implies, a modification to the objective func- 
tion as shown in (III. 2) is possible. That is, we would 
like to insure that should additional nursing resources 
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be available after satisfaction of the lower bounds, 
they would be assigned to the higher priority combina- 
tions. Thus, if we set p^ = 1 for the high priority 
combinations, the objective function would include a 
measure reflecting the importance of assigning addition- 
al time as is available to high priority combinations. 

In our rudimentary system, note that no further pri- 
ority ranking is given. Such a determination might be 
the subject of future research. The entire process 
described above is obviously highly subjective, as is 
the entire concept of priority setting. It is, however, 
believed to be a reasonable way to handle a difficult 
problem using the power of the mathematical model. 

Finally, constraint set (III. 6) allows the 
specification of staffing level restrictions, again by 
using upper and lower bounds. Minimum coverage by skill 
level may be obtained by using the appropriate lower 
bounds. Staf f-to-patient ratios may also be incorporated 
here. The upper bounds can be set to reflect the exi- 
gencies of the hiring prospects for a skill level, or 
simply to restrict the allowable number of nurses of each 
skill level the model may assign. Note that the varia- 
bles n^ have been restricted to be integers. By comput- 
ing the availabilities a^ on the basis of an eight-hour 
shift, the integrality restrictions imply the assumption 
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of no part-time personnel. Although this is the 
manner in which we propose to use the model, an exten- 
sion to a part-time personnel policy is feasible. For 
example, if the a^ were computed for a four-hour shift, 
the number of half-time personnel would be obtained, it 
then being an easy step to determine the full- and part- 
time staffing indicated. 

An additional and very important aspect of the 
Basic Staffing Model is its adaptability for use in a 
variety of circumstances. Parameters may be set in such 
a way that staffing and assignment patterns for any given 
unit and/or shift may be calculated, assuming the appro- 
priate information is available. Its greatest potential, 
however, is realized in aiding the solution of the aggre- 
gate budgeting problem. In fact, due to the nature of 
the available performance time data, it is best applied 
using a 24-hour day as the basic time frame. Taking the 
parameters a^ to reflect per shift availability, total 
daily staffing requirements and gross assignments of 
skill levels to care areas and patient classes can be 
derived. Information presented in this fashion allows 
for the flexible scheduling of the daily routine accord- 
ing to shift and individual personnel by the director of 
nursing in any way deemed appropriate for a particular 
facility. 
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III. 5 Extensions to the Basic Staffing Model 

Assuming that the Basic-Staffing Model is an 
accurate normative form of the administrator's problem, 
an optimal solution will yield sufficient information to 
enable specification of the best feasible staffing mix; 
that is, subject to the assumed constraints on budget, 
patient mix, and so on. Although this optimal solution 
is important of itself, however, the model from which it 
was obtained allows of no information concerning alterna- 
tive courses of action. That such information is of con- 
sequence to the administrator should be quite clear. 

The potential impact on staffing of the internally and 
externally controlled factors alluded to in our discus- 
sion of the administrator's problem must be weighed in 
reaching an acceptable solution. Then, too, the effects 
of alterations in data obtained through subjective evalu- 
ation (e.g., priority estimates) on the solution can be 
of interest. 

Specifically, we propose to demonstrate a metho- 
dology that will provide insights into the Basic Staff- 
ing Model for the following four areas of concern: 

1. The effect of altering the total number of 
hours of patient-centered activities pro- 
vided (the "service level") on the objective 
function, staff mix, and required budget; 



84 



2. The impact of changes in the budget level 
on the objective function, staff mix, and 
service level; 

3. The effect on the objective function, staff 
mix, and required budget of alterations in 
the priority-related demand estimates; 

4. The impact on the objective function, staff 
mix, and required budget of changes in regu- 
lations concerning minimum staffing by skill 
level and/or staf f-to-patient ratios. 

Note that we may examine three subsidiary prob- 
lems by means of the methods developed in connection with 
the first area. First, problems which are suboptimal 
with respect to the Basic Staffing Model optimal solu- 
tion (for a given set of parameters) may be generated by 
restricting service levels to be less than that attained 
by the BSM optimum. The resulting staff mix and budget 
required to support the mix for each such problem can 
assist the administrator in recognizing a suboptimal situ- 
ation in the existing facility staffing pattern. In 
addition, similar information may be helpful in planning 
a budget cut, since it would be easy to see the effect on 
total service and the objective function resulting from 
a decrease in the required budget. In this instance we 
simply interpret the result of the first subsidiary 
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problem in a different fashion. Finally, by relaxing 
the budget constraint and solving with alternative ser- 
vice levels, the effect of more funds on staff mix and 
the objective function may be examined. This allows for 
the consideration of the cost of providing increased 
care. A study of the results of the second area of con- 
cern allows the administrator to directly assess the 
impact of budgetary alterations. Both the service level 
and budgetary approach can provide valuable information 
for the specification of alternative courses of action, 
depending upon the orientation desired. 

In order to obtain information with respect to 
the service level, the Basic Staffing Model is modified 
so that we may explicitly restrict the service level to 
be less than or equal to a constant, S. The resulting 
model, called Model I, is given as follows: 
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s.t. £ s i n i f Bu ^get 



i 




V i 






(III. 9) 



£ n^ £ IK and integer, V i 




Vi, j , k 



L 



jk - Pjk - U jk 



V j ,k 



All quantities in Model I are as in the BSM. The con- 
straint (III. 9) is the necessary change. Obviously, we 
might solve Model I for each value of S, or budget level, 
of interest, such a tactic necessitating the solution of 
several mixed-integer linear programs. Obtaining solu- 
tions in this manner, however, can be a costly and time- 
consuming process. For this reason, we shall develop a 
methodology that produces solutions much more efficient- 
ly by exploiting the structure of the model. We defer 
consideration of this development to Chapter V. 



information related to the final two areas of concern. 
Recall that the demand for nursing services for each care 
area and patient classification is essentially 



We next turn to the consideration of obtaining 
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determined by the model with ^ ^ a variable. Attached 

to each however, are upper and lower bounds deter- 

mined through priority related considerations. For a 
given mix of Int. B, Int. A and SNF patients determined 
by classification , the bounds on each of the three sets 
of care areas (indexed by k) are multiplied by the re- 
spective number of patients in each category. Thus, the 
resulting bounds yield, in the aggregate, the maximum 
and minimum demands for each j,k combination. Obviously, 
then, the third area of concern may be modeled by con- 
sidering alterations in all or a subset of these bound- 
ing restrictions, given in the BSM as (III. 8) 

On the other hand, the final area of concern is 
usually related to the alterations in the minimum re- 
quired number of personnel of various skill levels. Such 
changes may be incorporated by appropriate adjustments 
in the constraint set (III. 6) of the BSM. Combining both 
of these concepts, which require the examination of 
alternative configurations of upper and lower bounds, we 
propose Model II as a reasonable characterization: 



88 
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3 k J 



(III. 10) 



L.+e(L.*-L.Un.iU.M(U.*-U. ) and 

l l l—i—i l l 



(III .11) 



integer , V i 




e,*€[o,il 

In constraint sets (III. 11) and (III. 12) the upper and 



chosen so that in varying 0 and/or on the interval 
[0,1] the desired range of consideration for the respec- 
tive bounds is achieved. Constraint (III. 9) of Model I 
has again been included as (III. 10) in this most general 
form of Model II. In this instance, we may fix S a priori 
and examine alterations in bounds at any given service 
level . 

In an analogous manner to that for Model I, the 
parameters 0 and <A could be fixed at values of interest, 
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and the resulting mixed-integer linear programs solved. 
Again, however, such a method would be tedious, necessi- 
tating a search for a more practical means of solution. 
The derivation of an algorithm for efficient solution 
of problems in the form of Model II will also be under- 
taken in Chapter V. 

In summary, the extensions to the BSM incorpor- 
ated into Model I and Model II capture the essence of 
the administrator's problem. Assuming that both models 
may be ef f iciciently solved, we have indicated how vari- 
ous interpretations of the results can produce valuable 
information both for the administrator and the director 
of nursing. In fact, the BSM need never be solved ex- 
plicitly, its results being easily obtained, for example, 
as the initial problem in either Model I or Model II. 

An example of how the entire unified approach to staffing 
and the aggregate budget process, beginning with patient 
classification and ending with examination of the results 
of the two models, will be given in Chapter VI. 

III. 6 Data Requirements and Availability 

In order to properly implement Models I and II, 
data for the following parameters are required: 

1. Appropriateness measures, c. ; 



2. Availability constants, a^; 
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Priority-related upper and lower bounds on 



4 . 

5 

6 
7. 



demand for nursing service, and U^; 

Priority indicators, Pj^»* 

Upper and lower bounds on the staff mix by 
skill level, L. and U.; 

l l 

Salary costs by skill level, s^; 



Personnel budget allowance. 

Since items 5, 6, and 7 are likely to be highly depen- 
dent on local conditions and regulations we will not 
discuss them further here. We shall, however, indicate 
sample values for these parameters in connection with an 
example of the methodology to be presented in Chapter VI. 

The appropriateness measures, c.^^, must be de- 
termined for every combination of skill level i perform- 
ing the activities of care area j for patients of 
classification k. Obviously, such information is likely 
to be highly subjective, depending primarily on any given 
individual's perception of the appropriateness criterion. 
Having chosen to examine the issue of quality of care 
within our models, the appropriateness measures should 
reflect the relative efficiencies of assigning nurses 
with various degrees of professional expertise to the 
care of patients whose needs may differ considerably. 
Patient needs are obviously a function of both the over- 
all degree of dependence (as captured in the 



91 



classification) and the specific care area in question. 
The concept of effective assignment subsumes the basic 
assumption that assignment of underqualified and over- 
qualified personnel with respect to a particular care 
area/classification combination is to be avoided. 

Strictly speaking, the quantification of sub- 
jective perceptions of appropriateness should be execut- 
ed so that interval-scaled c. .. are obtained for use in 

ljk 

our models. One such technique applicable in this in- 
stance is the Q-sort method demonstrated by Whiting [79]. 
This system essentially requires an evaluator to rank 
order the combinations of personnel and procedure from 
most appropriate to least appropriate by placing speci- 
fied numbers of assignments into several discrete cate- 
gories. The choice of intervals and required numbers 
per interval can lead to a discretized normal distribu- 
tion, from which the interval ranking data are obtained. 
Liebman [38,39] applied this technique in deriving 
appropriateness measures for her staffing models, addi- 
tionally comparing several transformations of the inter- 
val data for their relative properties. It was found 
that the set of ordinally scaled appropriateness coef- 
ficients produced results which were more in keeping 
with the evaluator's original perceptions of effective- 
ness than results obtained using other transformations. 
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Consequently, Liebman recommended the use of ordinally 
scaled preference measures in future staffing studies. 

In order to derive such ordinal data for our 
models, the skill level preferences as reported by 
McKnight and Steorts [43] were utilized. Developed by 
a panel of three professional nurses, preferences are 
reported by patient nursing need as reflected in the 
tri-level classification (minimum, moderate, maximum 
care) for 71 specific nursing tasks. First and second 
choices of skill level usually are given. For purposes 
of completeness, the skill level category definitions 
which guided the panel are given below: 

RN : Registered Professional Nurse - one who has 

successfully completed a program in an 
approved school of nursing and who is 
licensed to practice nursing as a registered 
nurse in the state in which she is employed. 

LPN: Licensed Practical Nurse - (a) one who is 

a graduate of an approved school of prac- 
tical nursing and is duly licensed under the 
provisions of the Practical Nurse Act of the 
state in which she is employed; (b) one who 
has received a license by waiver or examina- 
tion as provided for in the Practical Nurse 
Act of the state in which she is employed. 



93 



NA: Nursing Assistant - one who is employed as 

an "auxiliary personnel" in the nursing 
service of a health facility to assist the 
nurse. These persons are employed and 
trained to perform tasks which involve 
specified services for patients as delegat- 
ed by the professional nurse and performed 
under the supervision of a professional 
nurse or licensed practical nurse. 

In order that compatibility between these results and 
results by care areas might be obtained, the tasks sub- 
sumed by each care area were identified, and the prefer- 
ences for each task examined collectively. Although in 
most cases the overall care area preference structure 
was apparent, professional judgment* was employed to 
indicate the structure for those cases that had no readi 
ly apparent solution. Scoring a first preference as 2 
and a second preference as 1, the resultant appropri- 
ateness measures are shown in Table III. 7. 

The second set of parameters of the staffing 
models which must be obtained are the availability con- 
stants, a^. The manner of presentation of these quanti- 
ties is certainly dependent on the time frame and employ 
ment status (full or part time) chosen for consideration 
Recall, however, that we shall assume a 24-hour time 



* Consultation with Eleanor McKnight, R.N., 



M.P.H. 
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Table III. 7 

Ordinal Skill Level Preferences by Care Area and 
Nursing Need, Based on the Study of McKnight and Steorts [43] 



Care Area 


Minimum 


Moderate 


Maximum 


1. Elimination 


NA-2 


NA-2 


LPN-2 




LPN-l 


LPN-l 


RN-1 


2. Catheterization 


RN-2 


RN-2 


RN-2 


3. Daily Care 


NA-2 


NA-2 


RN-2 




LPN-l 


LPN-l 


LPN-l 


4. Bathing 


NA-2 


NA-2 


RN-2 




LPN-l 


LPN-l 


LPN-l 


5. Bedmaking 


NA-2 


NA-2 


RN-2 




LPN-l 


LPN-l 


LPN-l 


6. Care of Skin 


NA-2 


LPN-2 


RN-2 




LPN-l 


RN-1 


LPN-l 


7. Care of Hair and Face 


NA-2 


NA-2 


NA-2 




LPN-l 


LPN-l 


LPN-l 


8. Oral Hygiene 


NA-2 


NA-2 


RN-2 




LPN-l 


LPN-l 


LPN-l 


9 . Care of Nails 


LPN-2 


LPN-2 


LPN-2 




NA-1 


NA-1 


NA-1 


10. Rehabilitation and 


NA-2 


NA-2 


RN-2 


Recreation 


LPN-l 


LPN-l 


LPN-l 


11. Food and Nourishment 


NA-2 


NA-2 


NA-2 


12. Medications 


RN-2 


RN-2 


RN-2 




LPN-l 


LPN-l 


LPN-l 


13. Cardinal Symptoms 


LPN-2 


LPN-2 


LPN-2 




RN-1 


RN-1 


RN-1 


14. Therapeutic Measures 


RN-2 


RN-2 


RN-2 




LPN-l 


LPN-l 


LPN-l 


15. Urine Tests 


LPN-2 


LPN-2 


LPN-2 




RN-1 


RN-1 


RN-1 


16. Enema 


NA-2 


NA-2 


LPN-2 




LPN-l 


LPN-l 


RN-1 


17. Care of Colostomy or 


LPN-2 


LPN-2 


LPN-2 


I leostomy 


RN-1 


RN-1 


RN-1 


18. Associated Nursing 


RN-2 


RN-2 


RN-2 


Care Functions 


LPN-l 


LPN-l 


LPN-l 


19. Weighing Patient 


NA-2 


NA-2 


RN-2 




LPN-l 


LPN-l 


LPN-l 


20. Gavaging Patient 


RN-2 


RN-2 


RN-2 


21. Female Procedures 


NA-2 


NA-2 


LPN-2 




LPN-l 


LPN-l 


NA-1 


22. Oxygen Therapy 


RN-2 


RN-2 


RN-2 


23. Care of Critically 111 


RN-2 


RN-2 


RN-2 




LPN-l 


LPN-l 


LPN-l 


24. Death of Patient 


NA-2 


NA-2 


NA-2 


25. Miscellaneous 


NA-2 


NA-2 


NA-2 
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span for the model with no part-time personnel allowed. 
The models then determine the number of nurses, by 
skill level, working an 8-hour shift, with shift specifi- 
cation for individuals being left to the discretion of 
the director of nursing. We would like to know, there- 
fore, the portion of a shift, averaged over all three 
shifts, that personnel of each skill level usually 
have available to devote to patient-centered activities. 

An indication of these availabilities was ob- 
tained by examination of the results of a nursing work 
sampling study concurrently undertaken by McKnight [42] 
in connection with her Colorado work. Data were obtained 
in four homes of various sizes related to the partition- 
ing of nursing time into five mutually exclusive activi- 
ty areas: 

1. Patient centered; 

2. Personnel centered (training, staff develop- 
ment) ; 

3. Unit centered (environment, supplies, etc.); 

4. Home centered; 

5. Other centered (personal, standby, relief). 
Patient-centered activities being more or less analogous 
to those tasks subsumed by the 25 care areas, the 
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percentage of time, averaged over all shifts, for these 
activities was obtained. When considering these data 
categorized by nursing skill level, both charge and 
staff RN 1 s and LPN ' s were included. The resulting per- 
centages and availabilities per eight-hour shift are 
shown in Table III. 8. Two factors were noted in the 

Table III. 8 



Percentages of Time (Averaged Over Three Shifts) and 
Number of Hours Available Per 8-Hour Shift for 
Patient-Centered Activities 



Skill Level 


Percentage 


Ava i lability/ 8 -hour s 


RN 


69.9 


5.6 


LPN 


72.1 


5.7 


NA 


56.6 


4.5 


derivation of 


these results. 


First, home size (as indi 



cated by the number of units) had an obvious effect on 
the distribution of professional nursing time. That is 
in the smaller facilities professional nurses typically 
gave more time to patient-centered activities and less 
to supervisory work than in the larger facilities. 
Second, a significant portion of the non-professional 
nurses' time was being given to unit and other centered 
actitivies . 
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We next consider the upper and lower bounds on 
the demand variables 0 . Recall that in addition to 

r D k 

allowing flexibility in meetings patient demands, the 
limits are set with consideration being given to the pri- 
ority of certain care areas over others within and be- 
tween patient classification levels. Using the average 
daily required performance time data shown in Table 
III. 6, professional assistance* was again enlisted to 
derive a sample set of limits. Adjustments in these 
bounds may be made by utilizing the properties of Model 
II, but the bounds proposed at least serve as a basis 
for examination of adjusted estimates. 

One factor in attempting to consider priorities 
as related to the bounds was the number of patients, by 
classification, who actually received care in connection 
with the activities of each care area. Assuming the 
distribution of activities among this patient sample is 
fairly representative of the distribution among patients 
of similar nursing needs, this input more or less re- 
flects the priorities existing on the units at the time 
of the original study. Taken together with more norma- 
tive concepts of priorities of care, these considerations 
formed part of the basis for setting bounds. 



* Consultation with Eleanor McKnight, R.N., M.P.H. 
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Table III. 9 below indicates the percentage of 
patients of each category receiving care and the upper 
and lower bounds for each care area. Note that informa- 
tion for care areas 20 and 24 is omitted due to lack of 
observed activities. For other care area/classification 
combinations not observed in the original study but for 
which partial information did exist, an estimate was 
made . 



By abstracting the information contained in Table 
III. 9 the average and extreme daily care times by patient 
classification may be derived. These results are pre- 
sented in Table III. 10. As can be seen, the low and 
average times are nearly equal, with a definite tendency 
towards allowing for greater staffing requirements. 

Finally, we present an indication of the priority 
structure for care area/patient classification combina- 
tions in the last column of Table III. 9. Recall that 
= 1 indicates a relatively high priority, and 
Pj^ = 0 a lower relative priority. 
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Table III. 9 



Upper and Lower 


Bound 


on 6., (in minutes), 
i k 


Percentage 


of 




Patients in Each Category Receiving Care, 


and Priority 




Constants p.. 


(N . =64, N ,= 
min mod 


95, N =36) 
, max 






Care Area 


Class . 


%Patients 


Average Lower 


Upper 


Pjk 


1. Elimination 


min 


14 


.53 


.33 


.66 


0 




mod 


60 


4.94 


3.75 


5.50 


0 




max 


72 


8.24 


9.00 


9.50 


1 


2. Catheterization 


min 


~T~ 


.02 


.01 


. U5 


cr~ 




mod 


6 


1.0 


.75 


1.50 


0 




max 


33 


5.84 


5.50 


7.00 


i 


3. Daily Care 


min 


89 


2.36 


2.25 


3.50 


i 




mod 


72 


23.23 


23.00 


26.00 


i 




max 


89 


43.79 


40.00 


45.00 


0 


4. Bathing 


min 


11 


2.59 


2.25 


3.25 


1 




mod 


21 


8.09 


7.50 


9.00 


1 




max 


31 


13.94 


13.50 


17.00 


1 


5. Bedmaking 


min 


55 


3.46 


3.00 


4.00 


0 




mod 


81 


6.11 


5.75 


7.50 


0 




max 


69 


6.41 


6.00 


8.50 


1 


6. Care of Skin 


min 


2 


.05 


.03 


.06 


0 




mod 


25 


1.79 


1.50 


2.75 


0 




max 


78 


11.10 


11.00 


15.00 


1 


7. Care of Hair 


min 


28 


.75 


.60 


.85 


0 


and Face 


mod 


79 


2.89 


2.25 


4.00 


0 




max 


94 


3.02 


3.00 


5.00 


1 


8. Oral Hygiene 


min 


9 


.02 


.01 


.03 


0 




mod 


29 


.58 


.50 


1.00 


0 




max 


56 


1.20 


1.00 


2.50 


0 


9. Care of Nails 


min 


9 


.13 


.10 


.15 


0 




mod 


17 


.35 


.30 


.45 


0 




max 


19 


.35 


.30 


.75 


0 


10. Rehabilitation 


min 


53 


1.31 


1.25 


2.50 


1 


and 


mod 


69 


4.61 


4.50 


7.50 


1 


Recreation 


max 


69 


6.68 


6.50 


10.00 


1 


11. Food and 


min 


98 


8.56 


7.50 


9.50 


0 


Nourishment 


mod 


100 


17.35 


16.50 


20.00 


0 




max 


100 


36.32 


35.00 


45.00 


0 


12. Medications 


min 


91 


5.00 


5.00 


7.50 


1 




mod 


88 


9.27 


8.50 


10.00 


0 




max 


82 


10.45 


10.00 


14.00 


1 


13. Cardinal 


min 


8 


.61 


.50 


.75 


0 


Symptoms 


mod 


17 


.90 


.75 


1.75 


0 




max 


28 


2.90 


2.50 


4.50 


0 


14. Therapeutic 


min 


3 


.04 


.03 


.06 


o 


Measures 


mod 


8 


.44 


.40 


.75 


0 




max 


11 


1.92 


1.50 


3.00 


0 


15. Urine Tests 


min 


2 


.17 


.15 


.30 


0 




mod 


8 


.49 


.40 


.75 


0 




max 


8 


.18 


.15 


.50 


0 



Table III. 9, Cont. 



Care Area 


Class. % 


Patients 


Average 


Lower 


Upper 


Pjk 


16. Enema 


min 


2 


.12 


.10 


.25 


0 




mod 


8 


1.00 


.75 


1.50 


0 




max 


17 


1.00 


.75 


2.00 


0 


17. Care of 


min 


0 


0 


0 


0 


0 


Colostomy or 


mod 


1 


.21 


1.25 


4.25 


0 


Ileostomy 


max 


0 


0 


1.25 


4.25 


0 


18. Associated 


min 


100 


4.13 


3.25 


5.25 


u 


Nursing Care 


mod 


100 


6.17 


5.75 


7.25 


0 


Functions 


max 


100 


6.21 


6.00 


8.50 


0 


19. Weighing Patient 


min 


9 


.04 


.03 


.05 


O' 




mod 


5 


.04 


.03 


.05 


0 




max 


0 


0 


.05 


.10 


0 


20. Gavaging Patient 


- Omitted 












21. Female Procedures 


min 


0 


0 


.01 


.04 


^ 0 




mod 


1 


.02 


.01 


.04 


0 




max 


0 


0 


.01 


.04 


0 


22. Oxygen Therapy 


min 


2 


.12 


.05 


.15 


0 




mod 


2 


.09 


.07 


.10 


0 




max 


6 


.16 


.15 


.30 


1 


23. Care of 


min 


2 


.20 


.10 


.25 


0 


Critically 111 


mod 


1 


.11 


.10 


.30 


1 




max 


0 


0 


.15 


.50 


1 


24. Death of Patient 


- Omitted 












25. Miscellaneous 


min 


loo 


5.05 


4.50 


5.75 


0 




mod 


100 


5.50 


5.00 


6.50 


1 




max 


100 


6.50 


6.50 


7.50 


1 



Table III. 10 

Average and Extreme Daily (24-hour) Patient Staffing 
Requirements (in hours) by Patient Classification 



Patient Classification 


Lower Bound 


Average 


Upper Bound 


Minimum 


.52 


.61 


.75 


Moderate 


1.49 


1.55 


1.99 


Maximum 


2.65 


2.77 


3.51 



Chapter IV. Patient Classification for Long-Term Care 
IV. 1 Introduction 

"...Patterns are the means by which we interpret 

the world." (Meisel [45, p. 1]) 

Having explored in general terms in the forego- 
ing chapter the question of patient classification in the 
long-term care setting, we now move to a more detailed 
treatment of the topic. First and foremost, we seek to 
demonstrate a methodology that will be attractive compu- 
tationally both for analytical purposes and for future 
implementation. The application of such a method to the 
best data currently available and the judicious use of 
the results obtained will lead us to a system which meets 
the criteria for an acceptable patient classification 
system established earlier. Recall that the essence of 
these guidelines was to categorize patients based on an 
assessment of their individual medical, psycho-social, 
and functioning status, and then to relate the classi- 
fication to demand for nursing services. As an addendum 
to the above, the classification system sought should be 
applicable in as efficient and least time-consuming a 
manner as possible. Having previously demonstrated how 
any tri-level system may be related to demand , we 
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concentrate here on derivation of a specific classifi- 
cation methodology. 

The results of previous studies discussed in 
Chapter II are, for the most part, lacking with respect 
to our criteria. Most are obviously unsuitable for re- 
lating the aforementioned factors to a prescribed level 
of care and thence to demand for service. In those 
studies where the relationships could be inferred, either 
new data have subsequently been obtained or the opera- 
tional characteristics of the method with respect to 
facility level use were not explored in depth. None of 
this is said by way of criticism of work that has pre- 
ceded this study; rather, it tends to demonstrate that 
no classification system will be universally acclaimed 
for all possible uses, and so we proceed. 

Initially, some general ideas on the generic 
class of problems known as pattern recognition will be 
discussed, followed by a brief overview of specific tech- 
niques applicable in solving such problems. After demon- 
strating the derivation of an existing technique due to 
Walker and Duncan [78] and a discussion of its pertinence 
to the current study, application of the method to the 
data will be undertaken, and a readily applied procedure 
will be proposed. Comparisons with past studies and 
some recommendations for future research will conclude 
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the chapter. 

IV. 2 The Pattern Recognition Problem 

Researchers in the behavioral, social, and 
physical sciences are often confronted with large amounts 
of data which are typically multivariate in nature. 
Fisher, Kronmal, and Diehr [23] point out that the medi- 
cal sciences also tend to generate information of this 
type, characteristically even more multivariate due to 
the nature of the field. All of these groups share the 
common goals of gaining an understanding of the underly- 
ing mechanisms of the animate or inanimate system which 
produced the data and becoming adept at analyzing new 
data in terms of historical precedents. Both goals are 
the province of the so-called pattern recognition prob- 
lem. 

In an excellent treatment of the subject of 
pattern recognition, Meisel [45] points out that the 
major aspects of any such study are the detection of 
possible regularities in the data and the synthesis of 
such information as may be available into a decision rule 
to be used in analyzing future data. This is the "recog- 
nition" aspect of the problem. Note, however, that the 
application of the rule need not only occur e>c post 
facto , but may be made an integral part of the 
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derivation of the rule itself. This latter contingen- 
cy is called "learning and recognition concurrently," 
the former being simply, "learning before recognition." 
Both cases are shown schematically in Figures IV . 1 and 
IV. 2 which have been reproduced from Meisel [45, p. 6]. 



Figure IV. 1 



Learning Before Recognition 




Figure IV . 2 

Learning and Recognition Concurrently 




The concepts of learning and recognition are identified 
by Sebestyen [64, p. 5] as being the key points of pat- 
tern recognition. The former is said to be, "...the 
estimation of the probability densities describing the 
distribution of the set of samples in the vector space, 
while the latter is, "...based on the evaluation of the 
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already learned conditional probability density of each 
class at the point in the vector space that represents 
the new input to be classified." 

Meisel goes on to point out, however, that 
typically the decision rule obtained via either of the 
above schemes is of academic interest only, primarily 
because the dimensionality of the problem (corresponding 
to the number of variables under consideration) is too 
large for practical usage. Certainly the advent of the 
digital computer has made work in such dimensions tract- 
able, but reduction of the data to a lower dimensionality 
is a pleasing concept from a computational as well as 
implementational point of view. This is especially true 
when the cost of obtaining information for further study 
or installation of the results is high. Some definitions 
will assist us in understanding the concepts presented. 
Definitions [45] 

Measurement Space: The space with dimensionality 

corresponding to the number of variables under 
consideration initially. 

Pattern Space: A finite-dimensional space of 

relatively low dimension compared to the 
measurement space. The corresponding varia- 
bles contain sufficient information to per- 
form the classification process. 
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Feature Selection: (Preprocessing): The deter- 

mination of the components of the pattern 
space . 

Pattern Classification: The derivation of a 

decision rule to classify an unlabeled sample 
in the pattern space. 

Given the modified framework above, the pattern recogni- 
tion problem may now be viewed as a combination of 
feature selection and pattern classification. Figure IV. 3 
taken from Meisel [45, p. 7] shows in flow chart form the 
revised pattern recognition process. 

Figure IV. 3 

Stages in the Derivation of the Decision Rule 

[Physical Syste~m] 

. * . 

[Measurement Space] 

4 [Preprocessing] 

% t 

fPattern Space] 

t [Preprocessing 

ik , 

[Reduced Pattern Space! 

, —sic 

Decision Rule 



The process of feature selection is therefore 



central to the concept of pattern recognition. In terms 
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of the current study it is obvious that feature selec- 
tion will be an appropriate approach to solving the 
problems posed by the second of our criteria. Meisel 
points out several general attributes which an acceptable 
set of features must possess. These include the follow- 
ing : 

1. Low dimensionality; 

2. Sufficient information to yield recognition 
of patterns; 

3. Geometric consistency (preservation of order 
and distance concepts) ; 

4. Consistency of features throughout the 
samples . 

Even having determined a set or sets of features satis- 
fying Meisel' s criteria we have yet to solve the prob- 
lem, for there almost certainly exists a dominance of 
some feature sets over others in terms of how well they 
perform the ultimate pattern recognition task. The de- 
termination of the best feature set falls within the pur- 
view of the "optimal subset (feature) selection" problem 
and will be discussed in more detail later in this chap- 
ter . 

A final concept which Meisel discusses is the 
quantification of the decision rule obtained from our 
process. In general, two methods are available. The 
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first is the specification of a boundary function such 
as 

b(x) = 0 (IV.l) 

where x is taken to be the vector of features for an 
unlabeled point. The decision rule simply stated is to 
assign the point to group one if (IV.l) is non-negative 
or to group two if (IV.l) is non-positive. Points fall- 
ing on the boundary may be assigned arbitrarily. For N 
groups, the derivation of 

( IV . 2 ) 

such boundaries is shown to be sufficient. The second 
approach uses vector functions of the features, one for 
each of the groups, called p^ (x) , p^ (*) , . . . , p n (x). The 
rule is to classify the new point as a member of group 
i if 

P i (x) > Pj (x) , V j ^ i (IV. 3) 

This method is most often associated with the technique 
of discriminant analysis which will be discussed below. 

It will also be shown to be a more general concept 
applicable to a number of multivariate techniques. 

Before concluding the discussion of pattern 
recognition, a few additional words on the reduction of 
dimensionality problems are in order. Most of the 
multivariate statistical techniques in general use today. 
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such as principal components analysis, factor analysis, 
discriminant analysis, cluster analysis, and multiple 
linear regression embody the concept of feature selec- 
tion as either their primary aim or a secondary method. 
More detail on these methods will be provided in the 
following section, but for now it is sufficient to say 
that there exists a number of mechanical procedures to 
deal with the problem in an efficient computational 
manner. Fisher, Kronmal, and Diehr [23], however, em- 
phasize the point that, "Since the goal is to classify, 
and not to demonstrate methodology, one need not re- 
strict the reduction of data to objective methods" [23, 
p. 381] . They further state that, "Rather than depend 
upon the sheer computational power of a computer, all 
available scientific knowledge should be used to reduce 
the task" [23, p. 382] . In short, the authors argue 
that beyond the techniques mentioned above, the applica- 
tion of intuition, expert opinion, and experience gained 
in previous studies is often the best way to extract 
features. One caveat that must be attached to their 
argument, however, is to be wary of confusing the pre- 
dictive powers of a feature set with a causal relation- 
ship to the classification. Care must be taken that this 
point is understood by those from whom intuitive informa- 
tion is solicited. In any event, the use of informed 
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opinion is invaluable not only in direct feature ex- 
traction, but also in interpretation of results obtained 
by mechanical processes. 

IV. 3 A Review of Multivariate Statistical Techniques 

We have seen that the patient classification 
problem posed earlier is of the pattern recognition form, 
specifically requiring the steps of feature extraction 
and pattern classification. The several statistical 
techniques that lend themselves either directly or in- 
directly to the solution of the problem each have certain 
attributes that make them possible candidates for our 
use. However, the necessary assumptions and restric- 
tions each possesses should make us wary of using them 
indiscr iminantely , and for this reason we will examine 
each for its strengths and weaknesses vis-a-vis the cur- 
rent problem. 

The method of principal components analysis is 
an often-used approach to feature extraction. The ess- 
ence of the technique is the extraction from the set of 
original variables a subset of lower dimension that 

i 

nevertheless explains a significant proportion of the 
total variation in the data sample. Beginning with no 
distributional assumption on the vector of variables, x, 
Lawley and Maxwell [37] show that if we take the matrix . 
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A as the variance-covariance matrix of the sample with 
eigenvalues 

A r A p (iv. 4) 

where A is (p x p) , and if we also have the associated 
eigenvectors of A as 

/*±1 ' /i±2 ' ‘ * * ' ( IV . 5 ) 

such that 

A j= o Vi / j (IV. 6) 

^i^i = 1 

then the principal components of a new vector y can be 
written as 

M?-’' * * ' MpZ- 
Assuming that 

E(y) 

Var(y) = A 

then we may write 

P 

Y = % =£ z^. (IV. 9) 

i-l i=l ^ 

where the are uncorrelated, implying that the y^ can 
be represented as a weighted combination of orthogonal 
vectors yu.^. Noting that we can write 

U’AU =-A_ (IV. 10) 

where U is the matrix whose columns are the eigenvectors 
andy\.is a diagonal matrix withA^/ i=l,...,p on the 



( IV . 7 ) 



( IV . 8 ) 
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diagonal, and further assuming that theA^ have been 
ordered by decreasing magnitude, the following result 
is obtained. With 

P 

tr (A) = Z A • (IV. 11) 

i=l 1 

being a good characterization of the variance of y, then 

P k 

- £X,I k P (IV. 12) 

i=l 1 i=l 1 

small implies that the k corresponding uncorrelated 
variables 

z i =Aj y» i=l , 2 , . . . , k (IV. 13) 

explain most of the variation in the sample, and hence 
are a good set of features to use. If we are willing 
to make the distributional assumption of multivariate 
normality on x then tests of the hypothesis that the 
remaining p-k eigenvalues are essentially equal but non- 
zero may be made to enable the conclusion that the 
residual variation is split equally among the remaining 
variables. Although principal components analysis pro- 
vides a set of features it does not allow for the direct 
completion of the classification process, since no 
dependent variables are involved. It would, therefore, 
probably have to be coupled with another method (e.g., 
discriminant analysis) which would classify based on 
principal components only. Additionally, the method is 
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not invariant with respect to difference in scales for 
the variables of the sample, a drawback that presents 
difficulties in the case of the current study. 

A method that does not suffer from this last 
distraction is factor analysis, an extension of the con- 
cepts of principal components. According to Harman 
[29] , the classical factor analytic model may be written 
as 



z.=a..F.+a. 0 F.+ ... +a . F +d.U., j = l,...,n (IV. 14) 

j 3 1 1 3 2 2 jm m j j 

where n is the number of observed variables which are 

linearly described in terms of m (<n) common factors 

and a unique factor U . . The a . . are called factor load- 

3 3i 

ings. The method thus represents the observed variables 
in terms of a weighted linear combination of common latent 
variables (factors) of lesser dimensionality than the 
original space, plus a specific factor accounting for 
variability which cannot be attributed to the common 
factors. In matrix terms we may write (IV. 14) as 

z =_A.F + c (IV. 15) 

where A is the matrix of factor loadings, F is the vector 
of common factors, and €_ an error vector which here in- 
cludes both the specific variability and any unrelia- 
bility. Now obviously one must specify in advance the 
number of factors, k, believed to give the most reason- 
able form to (IV. 15), one possible approach being to 
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increase k until no appreciable alteration in the amount 
of variation explained is forthcoming. With this in 
mind, we can determine-A. in (IV. 15) from a form of 



where, according to Morrison [46] , Z is the variance- 
covariance matrix of the sample and Y is the diagonal 
matrix with the errors 6^ on the diagonal. Lawley and 
Maxwell [37] recommend the use of the method of maximum 
likelihood to determine A and Y . They go on to point 
out, however, that for k>l we may not specify the matrix 
A uniquely, and therein lies both the flexibility and 
difficulty with this technique. 



intuitively appealing interpretation in terms of the 
original variables, with the loadings representing the 
amount of variation of an original variable explained by 
the corresponding latent variable. To this end, and 
taking advantage of the non-uniqueness property of A , a 
series of transformations on A may be undertaken to 
yield a reasonable form. These transformations are 
called rotations and may be described mathematically by 



where is the appropriate rotation matrix. Natural- 

ly there is a certain amount of subjectivity in choos- 



Z = A A + Y 



(IV. 16) 



We would like the latent variables to allow an 




(IV. 17) 



ing R^ + ^, causing Harman [29] to speak of the art 
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involved in the method at this point. Some commonly 
accepted structures do exist, however, that seem reason- 
able from an interpretive standpoint. One of these is 
rotation to "positive manifold," which simply stated is 
the application of the linear transformation to make all 
elements of TV non-negative. The rationale is that if we 
choose to use the correlation matrix in place of Z in 
(IV. 16), then we would not expect the loadings generally 
to be negative in any reasonable A. A second possible 
transformation is rotation to "simple structure," where- 
in there are only a few non-zero elements in any row 
and column of A , and the pattern of loadings is signifi- 
cantly different for each factor. Comrey [17] points 
out that such a structure allows for more intuitive in- 
terpretation of the latent variables in terms of the 
original sample variables. A final type of transforma- 
tion is a procrustean rotation, wherein an attempt is 
made to force the loading matrix to fit some precon- 
ceived target form developed from a subjective estima- 
tion of the pattern of loadings. Numerous computational 
schemes have been devised to perform these transforma- 
tions, the major differences in them being the ultimate 
structure sought and the orthogonality or obliqueness of 
the factors themselves. 

Certainly factor analysis is a reasonable 
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technique for feature extraction, the features being the 
latent variables. The method is highly subjective, how- 
ever, and like principal components analysis does not 
lead directly to an operational classification without 
an intermediate application of another method. In gen- 
eral, no distributional assumptions are made with regard 
to the sample, but applications of some of the advanced 
techniques suggested by Lawley and Maxwell [37] for test- 
ing hypotheses concerning the number of factors do re- 
quire such an assumption. 

A third possible technique for use in the pattern 
recognition problem is cluster analysis, a method very 
popular in biological taxonomy for a number of years. 
Meisel [45] differentiates between clustering methods 
and pattern classification on the basis of the fact that 
the samples used in the former are unlabeled, the goal 
being to determine natural groupings within the sample 
that might correspond to natural groupings in the popu- 
lation. Tryon and Bailey [73] call cluster analysis a 
"poor man's factor analysis," its methods being based 
more upon simple logic than esoteric mathematics. The 
clustering of variables into groups has an analogous 
interpretation to that of factor analysis, in that the 
clusters roughly correspond to the latent variables and 
as such require interpretation. Anderberg [2] suggests 
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that after deciding on the data to be used and variables 
to be included, a choice of clustering data units or 
variables must be made, followed by the choice of simi- 
larity measures, clustering criteria, and the number of 
clusters desired. As in principal components analysis 
some care must be taken with the scaling of variables to 
avoid nonsensical results. A usual technique is 
standardization to zero mean and unit variance of all 
variables. At this point there is a wide divergence in 
the specific algorithms available, depending largely upon 
whether one wishes to cluster variables or data units. 
Initially, we determine a convenient similarity measure 
to use. In the case of clustering of variables, the cor- 
relation matrix is computationally attractive. It 
should be noted that the specific type of correlation 
used will be a function of the scaling of the variables 
as ratio, interval, ordinal, or nominal. Anderberg sug- 
gests methods by which diverse mixtures of scales can be 
made compatible. For data units, convenient similarity 
measures (which are the elements of a similarity matrix S) 
are Euclidean distance 

D(x,y) = [ j“ (x. .-y. .) 2 ] 1/2 (IV. 18) 

i=l 13 3 

where n is the number of data units, or the "city block" 



distance function 
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D (x,y) = 



n 



i=l 



x . . -y . . 
il il 



(IV. 19) 



both of which satisfy the necessary condition that the 
measure be a metric. Taking our lead from Anderberg, 
the major classes of algorithms are the hierarchical and 
nonhierarchical methods, the latter being useful only 
in clustering data units. Among the hierarchical 
methods, agglomerative and devisive techniques are the 
most prominent. Agglomerative algorithms move from 
clusters of one unit each to a final stage where one 
cluster of all n units is formed. Intermediate stages 
are identified by various numbers of clusters. The de- 
visive algorithms perform in the opposite fashion. For 
the sake of illustration, a basic agglomerative algor- 
ithm due to Anderberg [2, p. 133] is presented: 

Step 1: Begin with n clusters each consisting 

of exactly one entity. Let the clusters be 
labeled 1 through n. 

Step 2: Search the similarity matrix S for 

the most similar pair of clusters. Let the 

chosen clusters be labeled p and q and let 

their associated similarity be s , p>q. 

pq 

Step 3. Reduce the number of clusters by 1 

through merger of clusters p and q. Label 
the product of the merger q and update the 
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similarity matrix entries in order to reflect 
the revised similarities between cluster q 
and all other existing clusters. Delete the 
row and column of S pertaining to cluster p. 

Step 4. Perform steps 2 and 3 a total of n-1 

times (at which point all entitites will be 
in one cluster) . At each stage record the 
identity of the clusters which are merged 
and the value of the similarity between them 
in order to have a complete record of the 
results . 

A widely used variant of this algorithm chooses 
Euclidean distance as the similarity measure and the 
centroids of the new clusters formed as the points of 
reference. Devisive algorithms cause successive splits 
in the set of data units until n clusters of cardinality 
one remain. Little will be said about nonhierarchical 
methods, save for the fact that, in general, they allow 
for the specification in advance of the ultimate number 
of clusters desired. For additional insights into com- 
parative measures of performance and application of the 
various algorithms, the reader is referred to Gower [27] 
and Slagle, Chang, and Lee [69]. 

In retrospect, a cluster analysis on the varia- 
bles of the sample would lead to feature extraction. 
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while analysis on the data units would generate natural 
classes. Attempts to cluster data units corresponding 
to a sample of patient assessment records from the CPAI 
were made in connection with the current study. 

Although appealing, this approach was found to be unac- 
ceptable both from the standpoint of compatability of 
results and the cluster membership derived from the data 
analysis. Additionally, the problem of functionally re- 
lating the features to the classes remains as in the 
previous methodologies explored. Specifically, clear 
discrimination between the clusters obtained in terms 
of level of care of member units could not be obtained. 
This effectively rendered the results unusable for deter- 
mining classification rules. As was noted in Chapter 
II, Parker and Boyd [53] were able to achieve acceptable 
results using a superset of the data currently under 
study by adding the centroid scores of those variables 
found to be most important from the "B/T criterion" and 
relating the sums to cluster membership. A retrospec- 
tive study on the data using this rule is not reported, 
however. Attempts in connection with the current study 
to apply such a rule were found to yield erratic and 
unacceptable results. 

A fourth branch of statistical methodology for 
our consideration is discriminant analysis, by which one 
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may ascribe a prior dependent variable to correspond to 

the labeled classification of each data vector x and 

derive a linear function of x 

— / 

y = wx # ( IV . 20 ) 

The vector w is a weighting vector and must be deter- 
mined from the data. It can be shown in the dichotomous 
case that 

w = (S 2 ) -1 d (IV. 21) 

where d = x-^-x^ and S is the sample variance-covariance 
matrix will yield a function (IV. 20) such that the vari- 
ance between the two groups in question is maximized. 

Such a criterion is reasonable from the standpoint of 
choosing w such that maximum discrimination may be made 
between groups one and two. The method has extensions 
to the k group case where k-1 functions of the form 
(IV. 20) must be specified. The classification rule which 
may be derived from discriminant analysis in the general 
case is to assign the data unit, x, to group i if the 
value of (IV. 20) at x is closer to (IV. 20) evaluated at 
the centroid of group i than the value of (IV. 20) at the 
centroid of group j, j^i. 

Discriminant analysis thus provides an accept- 
able classification rule which may be readily applied in 
accordance with our second criterion. The question of 
feature selection is not so straightforward. One 
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approach is to consider the magnitude of the elements of 
w as being indicative of the effect each variable has on 
the ultimate classification. This tactic may only be 
useful up to that point at which there exists an inordi- 
nate number of variables with similar w. . How to choose 

1 

among them is still an open question, with the optimal 
subset selection problem also coming into play. An 
additional note on the method is that any ordering infor- 
mation concerning the dependent variables is ignored, 
thus eliminating its potential use. The results of 
Parker and Boyd [53] reported in Chapter II are never- 
theless an encouraging application of the method, albeit 
some 60 variables were used to effect the classification. 

The widely used techniques of multiple linear 
regression (MLR) have met with great success, under the 
appropriate conditions, in handling the problems of 
classification and feature selection. The basic model is 

Z = X£ + € (IV. 22) 

where y is the vector of dependent variables (n x 1) , X 
the data matrix [nx(p+l)], £ the vector of parameters to 
be estimated [ (p+1) xl] , and £ the error vector (n x 1) . 
The method of maximum likelihood can be applied to obtain 
estimates of called b such that 

b = (x'x) - 1 x'y , (IV. 23) 

the model (IV. 22) thus being fitted and providing a good 
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classification tool. The theory of MLR has been suf- 
ficiently advanced to allow us to make some reasonable 
attempts at feature selection once (IV. 23) has been ob- 
tained. Draper and Smith [22] point out four ways in 
which this may be done. In each case an attempt is made 
to find a smallest subset of variables without signifi- 
cantly degrading the predictive power of the model. The 
four methods are the following : 

1. Examine all possible combination of varia- 
bles; there exist 2 P such nominations in 
individual regression models. 

2. Backward elimination - Solve the model in p 
variables. Test the hypothesis that each b^ 
obtained is significantly different from 
zero using an F- or t-test. Eliminate those 
found to be insignificant, resolve and repeat 
until all coefficients are found to be sig- 
nificant . 

3. Forward selection - Test each variable before 

entering it into the model using a partial 

F-test to see if it accounts for a signifi- 

2 

cant amount of variation using R as the cri- 
terion. Stop when no such variables can be 
found . 

4. Stepwise regression - Similar to forward 
selection, except at each step of the process 
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all variables in the model are reexamined 
to see if they are still significant using 
the partial F-test, as well as performing 
the function in (3) . Variables which do 
not meet the level of significance are 
either dropped or not added as the case may 
be . 

With its predictive capabilities and selection 
possibilities, MLR is the most appealing method examined 
to this point. The assumptions of multivariate normality 
of the data can often be justified using Central Limit 
Theorem arguments for large sample sizes. The disturb- 
ing fact, however, is that the dependent variable is 
assumed to be a continuous , linear function of x, a 
supposition that is actually unwarranted in a situation 
where we are attempting to fit a truly polychotomous 
response. Although the ordering information on the de- 
pendent variable will now be important, something more 
than an ordinal relationship will be implicitly assumed. 
That is, typically we might assign the value 1 and 2 to 
the response variable for two groups, with the obvious 
ordinal relationship implied. But MLR takes such an 
assignment as an interval measure, when in actuality 
interval values (if they could be obtained) might be 1 
and 10, for example. Additionally, the assumed 
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continuity of y can often lead to "fuzzy" classifica- 
tions such as 1.4 between 1 and 2, in which case one is 
hard pressed to make the assignment rule mathematically 
tractable. We conclude, therefore, that MLR probably 
comes closest to a reasonable technique for our use, but 
its limitations for present purposes are serious enough 
to warrant further exploration. 

As a final point, we reiterate the Bayesian 
approach of Parker [52] for dichotomous independent 
variables. Recall that, from Bayes' Theorem, the basic 
classification rule is determined from 

P(s|d.)P(d.) 

P (d . I S) = — (IV. 24) 

1 1 — m 

P(S I d )P(d. ) 
i=l 1 1 

with d^, i=l,...,m, indicating the categories and S 
being the binary vector representing a patient profile 
on the n independent variables. Further, 

P(s|d i )=P(S 1 |d i ) • P(S 2 | S 1 0d i ) *. . . • P^n^n-l^ d i } • (IV * 25) 

The rule is to assign the patient with profile S to 
category i if 

P(djs) > P (d S) , j^i . (IV. 26) 

Parker went on to perform a feature selection procedure 
using two ad_ hoc measures that indicated the ability of 
the original variables to separate the groups, and his 
results were quite reasonable. The Bayesian approach is 



126 



certainly useful in capturing the discrete nature of the 
ordered dependent variable, and methods such as those 
proposed by Parker to deal with feature selection seem 
apropos. That the Bayesian approach could be extended 
for use with polychotomous independent variables is 
obvious, but the ordinal information in the variables 
would not be fully exploited, and the computations would 
be tedious though easy. In summary, the method seems 
best applied to binary x, a fact emphasized by Parker. 

IV. 4 The Method of Walker and Duncan 

The foregoing section serves to point out the 
basic attributes and deficiencies of the methods sur- 
veyed. A major point that could be gleaned from the 
summaries presented is that if the strengths of discri- 
minant analysis in the classification realm and the 
techniques of feature extraction developed in conjunc- 
tion with multiple linear regression could be melded 
into a single method embodying each, a satisfactory 
answer to the pattern recognition problem currently 
under consideration might be achieved. It is submitted 
that such a method is one developed by Walker and 
Duncan [78 ] . 

The essential feature of this method is the 
modeling of the polychotomous response variable in terms 
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of the logistic function 

P = (l+e“-'£-) “ 1 (IV. 27) 

where we immediately assume the multivariate case with 
x being the vector of observations of the independent 
variables and £ the vector of parameters to be estimated. 
Note that since we assume a polychotomous response, the 
model (IV. 27) is a reasonable representation of reality. 
The graph of p vs. x, given for illustrative purposes as 
Figure IV. 4, is a symmetric sigmoid curve with asymtotes 
at zero and one. This is entirely in keeping with the 
notion that the observed response is an "either-or" 
affair; the individual observed either belongs to a par- 
ticular category or does not. 



P 



I 



Figure IV. 4 
Logistic Function 
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The logistic function has a long history of use 



in the field of bioassay along with the integrated 



normal curve, f (At), where 



f(M) = 

y/2n 




exp (--j-Av ) d/c. , -«»£ x f c» (IV. 28) 



From (IV. 28) the probit, P, may be defined as the normal 
deviate. We shall not dwell on the bioassay applications 
of the logistic function, but further amplification is 
provided by Ashton [4] . The impetus for the Walker and 
Duncan work was an analysis of data arising from a ten- 
year retrospective study of the medical status of 5,000 
individuals. The objective was to determine the pre- 
dictive relationship between ten variables and the inci- 
dence of various degrees of coronary heart disease. In 
a later application of the method, Talbert [72] was suc- 
cessful in modeling the stagewise development of ova in 
vitro with a polychotomous response variable. The 
authors indicate a wide range of additional applications 
in a number of diverse fields. 



Cornfield [19], may be drawn between the form (IV. 27) and 
a particular derivation of the discriminant function by 
a Bayesian approach. Considering an individual to have 
a prior probability p of being in group 1 and a (1-p) 
probability of being in group 2; and if f^ (x) and f ^ (x! ) 



A highly interesting connection, demonstrated by 



1 29 



are the likelihood functions for the two groups, then 
Bayes' Theorem gives the posterior probability of being 



in group 2 as 

(1-p) f 2 (x) 

P 2 ” (1-p) f 2 (x) +pf 1 (x) 



(IV. 29) 



which yields 



1 

P 2 „ f, (x) 

1 1-p f 2 ( x) 



(IV. 30) 



Under the usual multivariate normal assumptions of dis- 
criminant analysis f^(x) divided by f 2 (x) gives 

fJTxT - = ex P(-y[/li 2 *i~*L 2 £ ^2 ]+ -" E 1 ( — 1~— 2 } } (IV. 31) 



implying that (IV. 30) is of the form 

P 2 = 1+exp [- (<+x'^_) ] (IV. 32) 

which is the same as (IV. 27). The major difference be- 
tween the discriminant function and the logistic function 
approach, as pointed out by Cox [20] , is that the latter 
is not so highly dependent on the distributional assump- 
tion except under some very special cases. He goes on to 
state that. 



If it is operationally meaningful to consider 
Prob (y=l) and Prob (y=0) as defined for each 
fixed x and having approximately the form [of a 
linearized version of (IV. 27) taking (p^/P 2 ) 1 > 
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and if y=l , 0 are the only two possibilities, 
the approach from [the linear form] seems 
preferable to that based on the model of two 
normal x- populations [discriminant analysis] , 
because it assumes less. This applies whether 
the analysis is made to interpret data, or 
whether the object is to 'classify' a new indi- 
vidual. [20, p. 64] 

Note that the logistic function (IV. 27) as given 
may not be easily transformed to a linear form. Such a 
nonlinear function is called "intrinsically nonlinear" 
by Draper and Smith [22] . This has led many authors to 
use the transform mentioned by Cox, namely 

Po 

In — = + x'fi (IV. 33) 

Pl - F 

often called the "log odds" or logit. The approach taken 
by Walker and Duncan is to use instead the function 
(IV. 27) directly via a Taylor Series first-order approxi- 
mation, then to demonstrate a method whereby the normal 
equations may be solved by a recursive procedure. This 
represents a departure from the usual methods of fitting 
nonlinear models, which are usually based either on 
iterative schemes to solve the linearized approximation 
to (IV. 23) or steepest descent algorithms that use local 
gradients as their basic device. The derivation that 
follows is for the case of a trichotomous response vari- 
able, since this is the model we will ultimately utilize, 
and is based on the derivation given in the original 



work . 
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We begin by assuming that with each of our N 
observations there is associated a response variable 
which may be captured as 

p^ = 0 if the observation does not fall into 
class 1, 

=1 if the observation falls into class 1, 

P 2 = 0 if the observation does not fall into 
class 2, 

=1 if the observation falls into class 2, 
p 3 = i - Pi - p 2 

=0 if the observation does not fall into 
class 3, 

=1 if the observation falls into class 3. 
Essentially we shall fit two functions simultaneously, 
corresponding to 

(IV. 34) 



p l=E(Pl) =f 1 =f («A 1 ,^,x) = (1+exp (-.e^-x '£_) 1 



and 



-1 



P 1 +P 2 =E (p 1 +p 2 ) =f 2 =f (<=< 2 ,£,x) = (1+exp (-^ 2 -x'^) ) (IV. 35) 
from which 



P 3 = 1 - (Pj+Pj) 



(IV. 36) 



can be determined. The assumption made is that 

p i + p 2 > P-l (IV. 37) 

which allows us to write f^ and f 2 with the same 
"slope," 6 , and illustrates that the ordinal nature of 
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the groups is important. That is, group 1 is at a 

lower level than group 2 , both of which are at a lower 

level than group 3. In the application that follows, 

this assumption is borne out by noting that a patient 

requiring Intermediate B level care requires less care 

than one needing Intermediate A care, both requiring 

less care than one needing Skilled Nursing care. The 

t h 

model for the n 1 " observation is therefore 



In 



Pin = f «i'*'2 n > +c ln 
+ ?2n = f( ”‘2'e-'*n ) + C 2n 



(IV. 38) 



We now take as a linear, first-order approximation to 
(IV. 38) the first two terms of the Taylor Series expan- 
sions of each, yielding 



(IV. 39) 



P ln =f ( *l'£'-n )+ TS~1 + TJ. 

S 1 £ 2 '£'-n ) + 2 f( ^2'&'-n ) g 

Pln +p 2n =f *2 ' f + — + 

where anc ^ £ are assume< ^ values of the respec- 

tive parameters. From (IV. 39) we derive so-called 
"working observations" y* n and y* n as 

<ff(* , ,£,x )- f (<* I x ) - 

. ., T - , . 1 * “ n 1. 1'Mf (IV. 40) 

y!n=Pln- £W l'l^n ) + ~ 






y 2n = Pln +p 2n- <fU 2'^^n )+ - 



& 2 



' + 
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The partials may be computed from (IV. 27) and found to 
be 



f 1 


Sf 2 _ n 


1 Pln ln 


' P 3n Q 3n 




(IV 




* f 2 


s pj P ln Q ln X jn ' 


jpj P 3n Q 3n X jn D" 1 ' 2 '---' 3 



where 

s = the number of variables plus two, 

P^ n and P^ n are as given by (IV. 34) and (IV. 36) 

respectively, and 

Q^ n and Q^ n are one minus p^ n and , respectively. 
It should be obvious at this point that since the indi- 
vidual equations of the model (IV. 38) are interrelated, 
some sort of weighting corresponding to a correlation 
must be given to the data matrix X. It can be shown 
that such a weight matrix will be the inverse of the 
diagonal matrix of variance and covariances of the 
errors, whose elements are the 2x2 matrices: 



w = V 1 (tf ) 
n — n 



2n 



Q 



3n 



A 

P 



In 



-1 

In 



Q 



-1 



3n 



(IV. 42) 
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We may now specify the model in terms of the working 
observations y* as 



where 



y* = AX0 + € 





( IV . 4 3 ) 



(IV. 44) 



( IV . 4 5 ) 



1— 1 


0 


X t i X . « • • • X . 

11 12 Is 


0 


1 


X 11 x 12 * * * x ls 



i— 1 


0 


X N1 X N2 * * * X Ns 


0 


i— 1 


X N1 X N2 * * * X Ns 



(IV. 46) 
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<s = 





(IV. 47) 



A = 



A 0 

A 2 

0 ‘A 



n 



where 



A = 
n 



P. Q, 
In In 

0 



p^Q 



3n 3n 



( IV . 4 8 ) 



, n=l , 2 , . . . ,N 



( IV . 4 9 ) 



in accordance with the form of (IV. 40). The analogous 
equation to (IV. 23) is therefore 



A -1 

9 = (X'AWAX) AW'Wy* , 



and we may also write 



V(0) = (X'AWAX) 



-1 



(IV. 50) 



(IV. 51) 



The derivation to this point has brought us to the form 
(IV. 50) which, as was mentioned earlier, might be solved 
using an iterative algorithm. Walker and Duncan, how- 
ever, suggest an alternative procedure based on a recur- 

r\ 

sive scheme that updates estimates of b =0 and V at 
the consideration of each observation. Without proof, 
the appropriate relationships are 
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V =V , -V ,x A D 1 A x'V , 
n n-1 n-l-n n n n-n n-1 



(IV. 52) 



where 



D 



=A x ' V . x A +W 
n n -11 n-l-n n n 



(IV. 53) 



and 



b =b _+V . x A D 

— n -n-1 n-l-n n n 



-1 



Pi “ Pi 
In In 



Pin + p 2n ‘ ( hn +J 2n> 



(IV. 54) 



and where 

P ln and P 2n are (IV. 27) appropriately evalu- 
ated with the best current estimates of ^ being 
used . 

This essentially completes the derivation save for some 
computational considerations. 

First of all, it can be seen that the scope of 
one iteration in this method encompasses the ultimate 
use of all available data, with the estimates of hope- 
fully stabilizing within its course. A convergence 
check based on the final b and the initial guess b^ is 
made after each such iteration, a useful form being 



b. - b 
1 oi 






i 1,2, ...,s 



(IV. 55) 



l 

-4 

where & is a tolerance value, say 10 . Additionally, 

the authors demonstrate that at any stage of an itera- 
tion the effect of the initial guesses of b and V may 
be removed by application of the following, where k denotes 
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the observation at which the correction takes place and 
starred quantities indicate the current values. 



V, = (V* 1 - V 1 ) 1 

k k o 



*k - V k (V k'^k- V c 1 ^o ) 



(IV. 56) 



As a final computational feature, an initial centering 
of the data by subtraction of the arithmetic mean of each 
variable assists in obtaining orthogonality between the 
estimates of ^ andc^* Presumably, if strong colineari- 
ty among the variables is suspected, methods which yield 
orthogonal columns of the data matrix X could be applied; 
these were not attempted in this study. 

Upon fitting the model (IV. 38) by obtaining 
estimates «<* , * , and we would naturally desire to 

have a classification rule and some measures of the 
effectiveness of the model which are either self-contained 
or rule-dependent. Towards the end of deriving the 
rule, note that the following posterior probabilities of 
an individual x n being in each of the three groups may be 



computed after fitting the model: 

P ln = (1 + exp(-%|* - x^*))- 1 (IV. 57) 

p*n= ( 1 + ex P ( -«(*-x ^*) ) ‘ 1 - (l+expt-j*-^*)) -1 (IV. 58) 

p* 3n = 1 - expW* - 



(IV. 59) 
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We therefore propose the following simple classifica- 
tion rule: 

For an individual represented by the vector of 

observations x^, classify as being in group i if 
p* > p* , for all j^i, j=l,2,3 (IV. 60) 

where p£ n are given by (IV. 57), (IV. 58), (IV. 59). 

In order to check the merit of the classification 
rule, a classification analysis procedure suggested by 
Parker [52,53] may be utilized. This essentially in- 
volves the comparison of the prior and posterior classi- 
fications, individual by individual, agreement consti- 
tuting a "hit" while disagreement implies a "miss." Note 
that the prior classification is assumed to be 



MAX 



i— 1 ,2,3, 



l in 

and represents professional judgment as to a patient's 
appropriate level of care; and the posterior classifica- 
tion is 



MAX , i=l , 2 , 3 / 

i "in 

and represents the model's prediction of a patient's 
level of care. As a means of displaying the dispersion 
of misses, a classification matrix can be formed by 
entering in the appropriate cells the number of indi- 
viduals with the given comparison of prior and posterior 
classifications. Additional information available 
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includes both a measure of the prediction and recogni- 
tion powers of the model in terms of the prediction and 
recognition rates, respectively. The prediction rate 
gives the percentage of patients with prior classifica- 
tion i among all patients assigned by the model to 
classification i. The recognition rate, on the other 
hand, is the percentage of patients with prior classifi- 
cation i which was asserted by the model to have the same 
posterior classification. The form of the classifica- 
tion matrix is shown in Figure IV. 5. 



Figure IV. 5 

Classification Matrix as Suggested by Parker 



Posterior Classification 



12 3 

1 

Prior Classification 2 

3 

Prediction Rate: a/a+d+g e/b+e+h 



a 


b 


c 


d 


e 


f 


g 


h 


i 



Recognition Rate: 
a/a+b+c 
e/d+e+f 
i/g+h+i 
i/c+f +i 



As an added measure, the usual type of analysis 
of variance (ANOVA) may be performed. We based our calcu- 
lations on the linear approximations to the functions 
ftt*, x n , £_*) and f(A* 2 , , ^* ) provided by the working 
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observations (IV. 40) evaluated at * , oC | . We denote 
the (2N x 1) vector of such approximations as y* and the 
(2N x 1) vector of errors 

E= (y-y) (IV. 61) 

where 

y is simply the (2N x 1) vector of pairs consist- 
ing of the logistic functions evaluated at 

oi * , ft*, and x and d * , 6*, and x . 

1 r- — n Z V — n 

Recalling that p is the number of variables under con- 
sideration, we write the ANOVA in the following format: 

Figure IV. 6 

ANOVA for Logistic Model 
ANOVA (Uncorrected for Mean) 



Source 


Degrees of Freedom 


Sum of Squares 


Mean Square 


F Ratio 


Regression 


p+ X. 


|S*'XWy* 


SSR/df 


MSR/MSE 


Error 


IN - p - 2 


E' WE 


SSE/df 




Total 


IN-1 


£*'XWy*+E'WE 







Note that the ANOVA has been derived without a correction 
for the mean of y* because the concept of a mean element 
value is obscured by the pair-wise nature of the elements 
themselves. Nevertheless, the information available 
from Figure IV . 6 provides a good indication of explained 
variation. Observe, however, that due to the fact that a 
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linear approximation to the logistic is being fitted by 
our model, the vector y* is dependent upon the estimates 
derived and will not, in general, be invariant with re- 
spect to alterations in the number of variables, p, for 
example. This will have implications in computing the 
regression and total sums of squares, since in the ordi- 
nary linear regression case the quantity used is y, 
which is invariant. 

It is perhaps wise at this point to pause and 
emphasize a few general points concerning the use of re- 
gression-type models in classif icatory applications. The 
first such item was mentioned earlier but bears repeat- 
ing: the relationships we seek between the independent 

variables (regressors) and the dependent variable (re- 
sponse) are of a predictive nature versus a causative 
one. In general, the functional specifications of each 
of these relationships will differ, and this is often a 
source of confusion for the unwary user of the results. 

In terms of our attempt to classify long-term care 
patients this caveat implies that the outcomes of the 
study should not be viewed as relating those factors 
deemed to be of consequence as directly causing a patient, 
of necessity, to be placed in a particular level of care. 
Rather, the preferred interpretation is that for the 
data available the quantitatively salient factors were 
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able to explain a significant portion of the variation 
caused by attempting to obtain congruence between a very 
large (although finite) number of possible vectors denot- 
ing patient status and three levels of care. It is 
therefore assumed in all such studies that the state of 
the world with regard to long-term care patient status 
is fairly regular, thus allowing the derived model to be 
used for predictive purposes, with the added caution given 
by Draper and Smith [22, p. 241] to, "...restrict pre- 
diction to the region of x-space from which the original 
data were obtained . " 

An additional consideration in this regard con- 
cerns that portion of the variation mentioned above which 
the model cannot explain. A more general specification 
of (IV. 38) might be as follows: 

Pin = 9 (^ 1 '^ , -n ;0 l ,0 2' * * * (IV. 62) 

p ln + P 2n = g(* t 2'^'-n ;6 l ,0 2' * * * 
where parameters previously used are as before, but g 
denotes a general functional form which is also para- 
metric on 0 i=2,...,k which are unknown to us, and m 

is a vector of measurement errors in the regressors. A 
model such as (IV. 62) can be used to illustrate the 
assumption that the unexplained variation is due to 
factors which are obscured to the researcher and to 
measurement error. The model applied is based upon such 
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an assumption. Blalock [9] summarizes and extends these 
concepts in a concise fashion by noting that a form simi- 
lar to (IV. 62), but without m, is usually acceptable in 
cases where the relative magnitude of error due to un- 
known factors is assumed to be greater than that attribut- 
able to measurement error. He goes on to point out that 
such a form is also a reasonable specification for non- 
experimentally derived data; that is, data which are ob- 
tained without the benefit of experimental design to con- 
trol for effects. Methods for isolating the effects of 
measurement errors and errors due to unknown factors 
(when both are assumed to be present) are relatively 
undeveloped . 

IV. 5 The Variable Subset Selection Problem 

Near the outset of the chapter the question of 
feature extraction was emphasized as being an integral 
part of the pattern recognition process. Mention was 
also made of the problem involving the identification of 
the optimal set of features of size n, the so-called 
optimal subset selection problem. Some additional com- 
ments on each of these are in order. 

After having fitted the model using all of the 
appropriate and available data at our disposal, it be- 
comes necessary to reformulate our procedure somewhat so 
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that the classification function may be placed in a 
useful framework for the ultimate consumer. Usually 
this reformulation is characterized by a reduction in 
the number of original variables used to effect the model 
fit. There are two main reasons for this. First, if 
the marginal cost of obtaining each additional piece of 
data is relatively high it behooves us to present as low- 
cost a solution as can be obtained. The second consider- 
ation is that it is entirely possible that certain of 
the original variables might actually be obscuring the 
predictive power of the model by interacting in a nega- 
tive way with other variables. The elimination of such 
variables could therefore lead to an improved model if 
the prediction rate is the criterion used to choose be- 
tween alternatives. 

Theoretically, at least, we may postulate the 
existence of a subset of the variables of size n which 
would yield the best fit (in terms of some criterion) 
among all other subsets of size n. In addition, if we 
assume that the marginal cost of obtaining each variable 
subsequent to the first few is constant, then n could be 
interpreted to be the maximum number of variables that 
could be obtained within budgetary limits. The problem 
posed by the attempt to find such subsets is called an 
optimal subset selection problem, the criterion in the 
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more general situation usually being one determined from 
information theoretic concepts. 

For the case of ordinary multiple linear re- 
gression, Boyce, Farhi, and Weischedel [11] have proposed 
an enumerative algorithm which determines the variables 
constituting an optimal subset of size n based on the 
criterion of percentage of variance explained. This 
essentially combinatorial problem is solved by estab- 
lishing bounds to implicitly evaluate certain combina- 
tions of variables which are dominated by others. The 
search is bounded and exhaustive. Further, the authors 
claim that the method is superior to the feature selec- 
tion methods posed earlier in the chapter due to its 
ability to find the global optimum versus local optima. 
Although attractive, the algorithm does not appear to be 
readily applicable to the non-linear regression problem, 
primarily because of the dependence of the regression 
and total sums of squares on the estimated parameters via 
the vector y * . When considering problems of size n, for 
example, the total sum of squares would not be invari- 
ant, but would depend upon the fit itself and thus would 
lead to difficulties in establishing the necessary bound- 
ing relationships . 

In the absence of such refined methodology, we 
have attempted to establish a reasonable list of 
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candidate subsets for examination both by following the 
recommendations of Fisher, et a_l. [23] and invoking the 
technique of backward elimination from ordinary MLR. By 
doing so we will not, in all likelihood, find the opti- 
mal subsets of whatever sizes are chosen as being within 
reason, but as will be shown below the criteria do not 
appear to be highly sensitive. 

IV. 6 Results 

We turn now to the specific application of the 
methodology to the problem of patient classification. 
Recalling that we initially seek a recognition of pat- 
terns in the data as they relate to the response variable, 
and then to select features in order to devise an opera- 
tional classification system, we proceed as follows. 

Making use of the available data from the Col- 
laborative Patient Assessment Instrument (CPAI) which 
was discussed in Chapter III, the following modifications 
were made. First, the original sample of 623 patient 
records was reduced by 295 through elimination of the 
data obtained by the Harvard research group. The ration- 
ale for this action was twofold. Discussions with re- 
searchers in the study which produced the CPAI indicated 
that the criteria used by the experts in judging the 
appropriate level of care for the eliminated data were 
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different from those used by the other two groups. 

Parker [53] noted a difference between the Harvard data 
and the data from the other two groups. After applica- 
tion of discriminant analysis to the CPAI data with 
judgment of "appropriate level of care" categorized by 
research groups as the dependent variable , a high de- 
gree of group variability was found between the research 
groups. This phenomenon was not, however , as pro- 
nounced between the Hopkins and HANYS data. 

A further reduction of 22 individuals was made 
to eliminate patients deemed to have an appropriate care 
level of "chronic care" or patients with incomplete 
records. Finally, on the recommendation of professional 
persons the record sets of patients with an appropriate 
care level of "Domiciliary" and "Intermediate B" were 
merged since these patients are likely to be quite simi- 
lar with respect to medical, functional, and psycho- 
social status, other factors intervening in their ulti- 
mate level of care assignment. The final sample used in 
the analysis is depicted in Table IV. 1 

Turning our attention to the set of variables 
to be used, recall that Table III.l of Chapter III listed 
some 60 items included in the original CPAI, falling into 
the broad categories of socio-demographic data, function- 
ing status items, impairments, and medical status items. 
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Table IV. 1 

Derived Sample for Classification Study 

Appropriate Level of Care 
Int. B Int. A SNF Total 

Sample Size 66 110 130 306 

It will also be remembered that we are seeking a post- 
assessment patient classification system whose purpose 
is to assist in predicting the demand for the level of 
nursing services in the long-term care setting. Of equal 
import is the potential value of the classification pro- 
cess for the placement of patients into an appropriate 
level of care, noting that other factors will also, of 
necessity, be considered. Since the main thrust of both 
purposes is related to what may broadly be termed a 
patient's health status, it is obvious that certain of 
the CPAI items do not and should not be considered in 
our system. Such variables fall mainly under the cate- 
gory of socio-demographics and as such were eliminated 
from the study. Other variables eliminated were those 
found to have such a low incidence of variation they 
could be considered to be constant over all data, and 
those items that were missing in a significant propor- 
tion of the records. The results of other studies cited 
in Chapter II tend to indicate, with the exception of 
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the socio-demographic variables, that the variables 
eliminated should not be of major predictive value. 

Table IV. 2 lists those variables that were ultimately 
included. In terms of the broad categorization used 
earlier, we can see that items 1-15 are functioning status 
items, 16-25 impairment items, and 26-37 medical status 
items, thus representing a desirable set of variables 
for the stated purposes of our study. 

The method of Walker and Duncan described in 
Section 4 was coded in FORTRAN IV for execution on the 
DEC System 10. Using the appropriate level of care as 
the dependent variable with its three possible values of 
Intermediate B, Intermediate A, and Skilled Nursing 
Facility, a run was made using all 37 variables listed 
in Table IV. 2. After completion of the calculations, an 
ANOVA and classification analysis were performed. In 
the classification analysis, "assigned level of care" 
refers to the assignment made by the model, while "appro- 
priate level of care" refers to the judgments of pro- 
fessionals as to the level of care the patient should be 
receiving. Table IV. 3a shows the resulting estimates of 
and £ by variable number, as well as the ratio of 
(3^ to s^, where s^ is the sample standard deviation of 
the estimate. Table IV. 3b gives the ANOVA, and Table 
IV. 3c the classification analysis. As a further 
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Table IV. 2 

Variables Selected for Classification Study 



1. 


Mobility Level 


19. 


2. 


Transferring 


20. 


3. 


Walking 


21. 


4. 


Wheeling 


22. 


5. 


Stair Climbing 


23. 


6. 


Bathing 


24. 


7. 


Dressing 


25. 


8. 


Eating /Feeding 


26. 


9. 


Toileting 


27. 


10. 


Bowel Function 


28. 


11. 


Bladder Function 


29. 


12. 


Orientation 


30. 


13. 


Communication of Needs 


31. 


14. 


Behavior Pattern (Freq.) 


32. 


15. 


Behavior Pattern (Type) 


33. 


16. 


Sight Impairment 


34. 


17. 


Hearing Impairment 


35. 


18. 


Speech Impairment 


36. 



37. 



Fractures and Dislocations 

Joint Motion (Upper) 

Joint Motion (Lower) 

Joint Pain and Swelling 
Missing Limbs (Lower) 

Paralysis /Pares is 
Dentition 
Cigarette Smoking 
Anemia 

Angina and/or M.I. 

Arthritis 

Cardiac Arrhythmias 
Congestive Heart Failure 

Diabetes Mellitus 

Hypertension , Essential 
(Incl. Malignant Hypertension) 

Malignancy 

Mental Illness 

Neurological Disorder(s) 

Chronic Respiratory Disease 



amplification of Table IV. 3, note that the F-test for 
evidence of overall regression is highly significant 
with p<.001. Also, the overall recognition rate was 
67% while the overall prediction rate was about 70%. 
Notice, too, that all but about 5% of the misses occurred 
one step off the main diagonal, with a noticeable skew 
towards a higher level of assignment. The assumed model 
obviously had the most difficulty in predicting placement 
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Table IV. 3a 

Results of Application of Walker and Duncan Method for 37 Variables 



Variable 


6* 


6J/s. 


Variable 


6 i 


6*/s . 
l l 


“i 


2.413 




17 


0.125 


0.885 


i 

°2 


5.200 


-- 


18 


-0.153 


-0.884 


1 


-0.310 


-3. 100 


19 


-0.192 


-1.356 


2 


-0.127 


-0.693 


20 


-0.238 


-0.840 


3 


0.274 


0.584 


21 


-0.019 


-0.079 


4 


0. 120 


1.198 


22 


0.038 


0.170 


5 


-0.441 


-4.413 


23 


0.223 


1.287 


6 


-0.439 


-2.197 


24 


-0.133 


-0.664 


7 


-0.322 


-1.609 


25 


-0.170 


-1.200 


8 


-0.470 


-2.712 


26 


0.604 


2.134 


9 


0.358 


1.791 


27 


0.723 


1.417 


10 


-0.114 


-0.805 


28 


-0.345 


-0.629 


11 


-0.008 


-0.083 


29 


-0.297 


-0.939 


12 


-0. 197 


-0.657 


30 


0.472 


0.776 


13 


-0.048 


-0.127 


31 


-0. 104 


-0.259 


14 


-0.379 


-1.012 


32 


-0.424 


-1.134 


15 


0.618 


2.185 


33 


0.307 


0.852 


16 


-0.204 


-1.018 


34 


-0.279 


-0.595 








35 


-1.810 


-2.419 








36 


-0.370 


-1.169 








37 


-1.095 


-2.283 



• Table IV. 3b 

ANOVA for Application with 37 Variables 



Source 


df 


SS 


MS 


F 


Regression 


36 


3774.02 


104.83 


44.5 


Error 


269 


633.74 


2.36 





Total 



305 



4407.76 
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Table IV. 3c 

Classification Matrix for Application with 37 Variables 

Assigned Level of Care 









Int . B 


Int • A 


SNF 


Total 


Recognition Rate 




Int . 


B 


41 


21 


4 


66 


.621 


Appropriate 
Level of 
Care 


Int • 


A 


10 


69 


31 


110 


.627 


SNF 




1 


29 


100 


130 


.767 




Total 




52 


119 


135 


306 


.672 


Prediction 


Rate 




.778 


.580 


.740 


.700 





of "Intermediate A" patients. Some further comments on 
these results will be made later. 

Having seen the results made available by fit- 
ting the logistic function model for all variables under 
consideration, we now turn to the subset selection prob- 
lem. Using both the intuitive and statistical ration- 
ales explained earlier, a total of six subsets of the 
original 37 variables were derived and are listed in 
Table IV. 4. The first two of these sets were formed from 
results reported by Parker and Boyd [53] in the applica- 
tion of discriminant and cluster analysis, respectively. 
Variables which were included in those studies but not 
in the present one were naturally excluded, and some 
additions were made to the second subset based on the 
first. The next two sets were determined from an 
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Table IV. 4 

Subsets of the Original Variables Chosen for Analysis 



Subset I (Parker and Boyd Discriminant Analysis with modifications) 



1. 


Mobility Level 


7. 


Communication 


2. 


Wheeling 


8. 


Behavior (Type) 


3. 


Bathing 


9. 


Fractures and Dislocations 


4. 


Dressing 


10. 


Paralysis/Paresis 


5. 


Toileting 


11. 


Dentition 


6. 


Orientation 


12. 


Neurological Disorder(s) 


Subset II (Parker and Boyd Cluster Analysis with modifications) 


1. 


Mobility Level 


7. 


Bowel Function 


2. 


Wheeling 


8. 


Communication 


3. 


Bathing 


9. 


Anemia 


4. 


Dressing 


10. 


Mental Illness 


5. 


Eating/Feeding 


11. 


Neurological Disorder(s) 


6. 


Toileting 


12. 


Chronic Respiratory Disease 


Subset III (Cluster 


Analysis 


One) 


1. 


Mobility Level 


7. 


Toileting 


2. 


Transferring 


8. 


Bowel Function 


3. 


Walking 


9. 


Bladder Function 


4. 


Wheeling 


10. 


Orientation 


5. 


Bathing 


11. 


Communication 


6. 


Eating/Feeding 


12. 


Speech Impairment 


Subset IV (Cluster Analysis 


Two) 


1. 


Transferring 


7. 


Toileting 


2. 


Walking 


8. 


Bowel Function 


3. 


Wheeling 


9. 


Bladder Function 


4. 


Bathing 


10. 


Orientation 


5. 


Dressing 


11. 


Communication 


6. 


Eating/Feeding 


12. 


Mental Illness 



Subset V (Backward Elimination, a=.10) 



1. 


Mobility Level 


7. 


Toileting 


2. 


Walking 


8. 


Behavior (Type) 


3. 


Stair Climbing 


9. 


Cigarettes 


4. 


Bathing 


10. 


Anemia 


5. 


Dressing 


11. 


Mental Illness 


6. 


Eating/Feeding 


12. 


Chronic Respiratory Disease 


ibset VI (Backward 


Elimination, a=.05) 


1. 


Mobility Level 


6. 


Behavior (Type) 


2. 


Stair Climbing 


7. 


Cigarettes 


3. 


Bathing 


8. 


Mental Illness 


4. 


Eating/Feeding 


9. 


Chronic Respiratory 


5. 


Toileting 




Disease 
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application of Parker and Boyd's hierarchical centroid 
clustering algorithm to combined samples of Intermedi- 
ate and Skilled care patients, the variables being 
chosen by the B/T ratio criterion. The final two sub- 
sets were formed after application of a t-test of sig- 
nificance of the estimated regression coefficients in 
the 37-variable problem, one being taken atc^^.lO, the 



other ato( = .05. The appropriate test statistic is: 



t = 



n - p° 



s . 

1 



(IV. 63) 



for which we test 



VP? - 0 

<Vf»° * 0 



Using the two-tailed t-test, under H^, the t-statistic 
values are given as p J/s^ in Table IV. 3a. Notice that 
all but the final subset contain 12 variables, a fact 
which was arrived at both by chance and design. The de- 
cision was made based on the fact that each subset 
originally contained about 12 variables, and the reduc- 
tion in data collection effort by two-thirds obtained by 
choosing 12 variables seemed reasonable. The 9-variable 
subset represents an attempt to further reduce the num- 
ber of variables based on the encouraging result obtained 
by use of subset V. 
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For the sake of compactness, the results of 

fitting the model for each of the subsets are given in 

Table IV. 5. Results of each ANOVA are summarized in 

Table IV. 6. The ANOVA for each of the subsets shows 

evidence of overall regression to be strong, with 

p <”.001 in each case. The ratio SSR/SST provides a 

2 

gross indication of an upper bound for R since, in 
general , 

- SSR - (Correction for Mean) 

R = (IV. 64) 

SST - (Correction for Mean) 

and since SSR «£ SST, then 

0 2 SSR v „2 , TT7 , c , 

R u = SST^ R ' (IV. 65) 

2 

If we were to choose R y as the criterion for deciding 

which subset was best, subsets I, V, and VI might be 

worthy of further consideration, although all six yield 

2 

approximately equal R . Such a measure is not, however, 
in keeping with the spirit of the analysis; that is, 
derivation of a method which has reasonably high predic- 
tive validity for classification purposes. We therefore 
present the classification matrices as Tables IV. 7 
through IV. 12 to assist us in reaching a decision. To 
facilitate the decision process, the following measures 
of performance may be abstracted from the classification 
matrices for presentation in Table IV. 13: 
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Table IV. 5 

Estimated 6. from Logistic Model for the Six Subsets of Table IV. 4 



SUBSETS 



Variable 


I 


II 


III 


IV 


V 


VI 


Mobility Level 


-0.372 


-0.404 


-0.360 


— 


-0.294 


-0.301 


Transferring 


-- 




-0.224 


-0.304 


-- 


-- 


Walking 


-- 


-- 


0.051 


-0.006 


0.126 


-- 


Wheeling 


0.071 


0.105 


0.109 


0.098 


— 


-- 


Stair Climbing 




-- 


-- 


-- 


-0.433 


-0.406 


Bathing 


-0.664 


-0.630 


-0.672 


-0.707 


-0.591 


-0.638 


Dressing 


-0.493 


-0.357 




-0.427 


-0.368 


-- 


Eating/Feeding 




-0.494 


-0.534 


-0.488 


-0.557 


-0.665 


Toileting 


0.357 


0.491 


0.362 


0.432 


0.428 


0.346 


Bowel Function 




-0.116 


-0.083 


-0.097 


-- 


-- 


Bladder Fen. 






0.021 


0.028 




-- 


Orientation 


-0.275 




-0.138 


-0.071 


-- 


-- 


Communication 


-0.285 


-0.287 


-0.315 


-0.286 






Behavior (Type) 


0.204 


-- 


-- 




0.283 


0.272 


Speech Impair. 


-- 


— 


-0.008 


-- 


-- 


-- 


Fractures 


-0.211 


-- 


-- 






-- 


Paralysis 


-0.094 


-- 




-- 


-- 


-- 


Dentition 


-0.202 






-- 




-- 


Cigarettes 


-- 




-- 


— 


0.644 


0.698 


Anemia 




0.560 


-- 


-- 


0.628 




Mental Illness 




-1.376 


-- 


-1.000 


-1.812 


-1.821 


Neurol. Dis. 


-0.131 


-0.243 










Chron. Res. Dis. 




-0.868 






-1.007 


-1.017 




1.563 


1.191 


0.932 


0.829 


1.817 


1.801 


a 2 


4.008 


3.706 


3.354 


3.222 


4.506 


4.431 



Table IV. 6 

ANOVA Summary for the Six Subsets of Table IV. 4 



SUBSETS 



Quantity 


I 


II 


III 


IV 


V 


VI 


MSR 


246.86 


205.98 


216.79 


217.67 


262.33 


353.57 


MSE 


2.06 


2.52 


2.29 


2.18 


2.32 


2.24 


F 


119.75 


81.64 


94.71 


100.02 


112.91 


157.97 


df 


11,294 


11,294 


11,294 


11,294 


11,294 


8,297 


SSR 


2715.5 


2265.76 


2384.71 


2394.42 


2885.64 


2828.54 


SST 


3321.6 


3007.51 


3057.71 


3034.28 


3568.69 


3493.31 


SSR/SST 


.83 


.75 


.78 


.79 


.81 


.81 
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1. Overall prediction rate, 

2. Overall recognition rate, 

3. Percentage of misses > 1 step off main 
diagonal . 

With the information made available by these 
tables, certain conclusions may be reached concerning a 
choice of subset. On all but the third of the perform- 
ance measures proposed, the fifth subset is shown superior 
to all other subsets examined as well as the original 
37-variable problem. This last point tends to lend 
credence to the hypothesis of negative interaction among 
some of the variables which appears as error due to the 
specification of non-interaction in the original model. 

The non-interactive logistic function model is probably 
a better representation of reality for subsets V and VI. 
Subset VI performed about as well as the original model 
in terms of the first two measures. The original model 
does appear to give a higher percentage of "near misses" 
than the reduced forms, but certainly not so significant 
an amount as to outweigh its additional workload. Final- 
ly, it can be observed that all seven attempts at fitting 
the model tended to give a slight preponderance of misses 
wherein a patient was deemed to be properly placed in a 
higher level of care than prior judgment indicated. This 
can be seen by examining the totals for the assigned 



Classification Matrix for Subset I 



Assigned Level of Care 





Int. B 


Int. A 


SNF 


Int. B 


33 


26 


7 


Appropriate Int. A 

Level of 

Care SNF 


14 


63 


33 


4 


31 


95 


Total 


51 


120 


135 


Prediction Rate 


.647 


.525' 


.703 



Total 


Recognition Rate 


66 


.500 


110 


.573 


130 


.730 



Table IV. 8 



Classification Matrix for Subset II 



Assigned Level of Care 





Int-. B 


Int. A 


SNF 


Int. B 


41 


19 


6 


Appropriate Int. A 
Level of 

Care SNF 


12 


67 


31 


0 


32 


98 


Total 


53 


118 


135 


Prediction Rate 


.774 


.568 


.726 



Total 


Recognition Rate 


66 


.621 


110 


.609 


130 


.754 
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Table IV. 9 

Classification Matrix for Subset III 
Assigned Level of Care 



Int. B 

Appropriate Int. A 
Leyel of 

Care SNF 

Total 

Prediction Rate 



Int . B 


Int. A 


SNF 


33 


27 


6 


9 


67 


34 


1 


31 


98 


43 


125 


138 


.767 


.536 


.710 



Total 

66 

110 

130 



Recognition Rate 
.500 
.609 
.754 



Table IV. 10 

Classification Matrix for Subset IV 
Assigned Level of Care 



Level of 
Care 





Int. B 


Int. A 


SNF 


Int. B 


34 


25 


7 


Int. A 


11 


63 


36 


SNF 


3 


30 


97 


Total 


48 


118 


140 


i Rate 


.708 


.534 


.693 



Total 


Recognition Rate 


66 


.515 


110 


.573 


130 


.746 
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Table IV. 11 

Classification Matrix for Subset V 
Assigned Level of Care 



Level of 
Care 





Int. B 


Int. A 


SNF 


Int. B 


44 


17 


5 


Int. A 


9 


67 


34 


SNF 


2 


23 


105 


Total 


55 


107 


144 


i Rate 


.800 


.626 


.729 



Total 

66 

110 

130 



Recognition 

.667 

.609 

.808 



Table IV. 12 



Classification Matrix for Subset VI 
Assigned Level of Care 
Int. B Int. A SNF Total 

66 
110 
130 



Total 


.54 


110 


142 


Prediction Rate 


.796 


.600 


.711 



Int. B 

Appropriate Int. A 
Level of 

Care SNF 



43 


17 


6 


9 


66 


35 


2 


27 


101 



Recognition 

.652 

.600 

.777 



Table IV. 13 

Summary of Performance Measures for the Six Subsets 



Measure I II III IV V 



CD 


.624 


.689 


.671 


.645 


.718 


( 2 ) 


.598 


.661 


.621 


.611 


.695 


( 3 ) 


9.6 


6.0 


6.5 


8.9 


7.8 



Rate 



Rate 



VI 



.702 

.676 

8.3 
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levels of care as compared with the totals for the 
appropriate levels of care. The net effect of this 
characteristic of the system is to make the classifica- 
tion a bit conservative when taken over the patient 
population of a facility. But since the ultimate use of 
this scheme is in predicting nursing demands, the nature 
of the errors made could lead to slight over-staffing, a 
situation perhaps more tolerable from the patient view- 
point than the reverse. This implies, of course, that 
the assumed cost of misclassif ication is equal whether 
the misclassif ication is to a higher or lower level of 
care . 

It is worthy of mention that since further dis- 
cussions with CPAI research personnel indicated the con- 
sideration of factors other than health status in adjudi- 
cating appropriate level of care (e.g., insurance 
arrangements, relatives at home, etc.), some attempt was 
made to analyze the records of those patients who missed 
on several of the runs. Only a general impression that 
the health status might be closer to the assigned versus 
appropriate level of care estimates could be obtained, 
however, due to unavailability of professional persons 
to reassess records in light of the current study goals. 

We conclude, therefore, by noting that subsets V 
and VI are the groupings of choice for consideration in 
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an operational patient classification system. Recall 
that no claim is made that these are the optimal subsets 
of size 12 and 9, respectively, in terms of any cri- 
terion; rather, they represent reasonable approximations 
due to an apparent insensitivity of the model among both 
intuitively and statistically derived subsets. What now 
remains is the translation of the fruits of the analysis 
to an instrument engineered for use by professional 
nurses in a typical long-term care facility. 

•Factors to be considered in the design of the 
patient classification form include the assurance that 
the same assessment scales will be used as in the CPAI 
data, that easily followed directions for use will be 
available, that a clear format be developed / and that a 
simple way to reproduce the pattern recognition function 
served by the logistic model be provided. The first of 
these items may be provided by including a table of 
assessment scores with the form, these being abstracted 
from the CPAI User's Manual [35]. The next two are im- 
plementation problems for which we propose a possible 
example. The last item is dealt with by use of easily 
followed formulae requiring addition, subtraction, multi- 
plication and three table look-ups in a table provided. 

In this last regard, the sensitivity of the model to a 
rounding of its estimated coefficients to one decimal 
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place was tested for subsets V and VI with a degradation 
of no more than 2% being observed in either the predic- 
tion or recognition rates. Hence, this more easily used 
format is recommended. 

Figure IV. 7 below illustrates the proposed 
patient classification form based on subset V. If it is 
desired to use subset VI instead, appropriate substitu- 
tions may be made using the estimates provided in Table 
IV. 5. ‘Note that the entire range of possible values of 
the logistic function exponent is represented in the 
"table of values." 

As a final note, the classification form of 
Figure IV. 7 may be further modified, if space permits, 
to include the display of assessment and classification 
over time by duplicating the fourth and fifth columns 
with an appropriate notation for the date. 

IV. 7 Final Remarks 

The analysis of this chapter has led us through 
the general pattern recognition problem to a specific 
model from which the essentials of the recognition prob- 
lem could be drawn, and thence to subset selection and 
the presentation of a proposed patient classification 
instrument based on medical and functioning status con- 
siderations only. The aggregate numbers of patients 
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Figure IV. 7 

Patient Classification Form 

Name I.D.# Room 

Directions ; Assess the patient with respect to the items below in 

accordance with the attached rating scale. Multiply the 
score obtained by the factor indicated and sum the results. 
Find Total 1 and Total 2 as shown, then consult the 
enclosed table to obtain values for Class B, Class A, and 
Class S. The,, maximum of these three numbers yields the 
Estimated Classification (Int. B, Int. A, SNF) 



Item # Description Factor Assessment Factor x Assessment 



1 


Mobility 


-0.3 






2 


Walking 


0.1 






3 


Stair Climbing 


-0.4 






4 


Bathing 


-0.6 






5 


Dressing 


-0.4 






6 


Eating/Feeding 


-0.6 






7 


Toileting 


0.4 






8 


Behavior (Type) 


0.3 






9 


Cigarettes 


0.6 






10 


Anemia 


0.6 






11 


Mental Illness 


-1.8 






12 


Chron. Res .Dis . 


-1.0 







Total = 



Total 1 = Total + 1.8 = 
Total 2 = Total + 4.5 = 
Class B = Value of Total 1 = 

Class A = Value of Total 2 - Value of Total 1 = 

Class S = 1 - Value of Total 2 = 



Class 
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Figure IV. 7, continued 



Assessment Rating Scales 



(Note: HA means Human Assistance, MA means Mechanical Assistance) 
[from: Patient Classification for Long-Term Care: User's Manual] 



Item Description 



Score Interpretation 



1 Mobility Level 



0 Gets outside without either MA or HA 

1 Gets outside with MA but without HA 

2 Gets outside with HA, with or without MA 

3 Confined to institution, moves about 
without HA or MA 

4 Confined to institution and moves about 
with MA but without HA 

5 Confined to institution, moves about 
with HA, with or without MA 

6 Confined to bed or chair 



2 Walking 



3 Stair Climbing 



4 Bathing 



5 Dressing 



6 Eating/Feeding 



0 Walks without MA or HA 

1 Walks with MA but without HA 

2 Walks with HA but withoug MA 

3 Walks with HA and MA 

4 Does not walk (bed or chair) 

5 Does not walk (bedfast) 

0 Goes up and down flight of stairs without 
HA or MA 

1 Goes with MA but not HA 

2 Goes with HA but not MA 

3 Goes with MA and HA 

4 Does not perform 

0 Bathes self without assistance 

1 Bathes self with MA only 

2 Bathes self with HA, with or without MA 

3 Does not bathe self 

0 Dresses self without assistance 

1 Dresses self with MA only 

2 Dresses self with HA, with or without MA 

3 Does not dress self 

0 Feeds self without assistance 

1 Feeds self with MA only 

2 Feeds self with HA, with or without MA 

3 Does not feed self, is fed 

4 Does not feed self, tube fed 

5 Does not feed self; parenteral administra- 
tion of fluids 
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Figure IV. 7, continued 
Assessment Rating Scales, Continued 
Item Description Score Interpretation 



7 


Toileting 


0 


Uses toilet room without assistance 






1 


Uses toilet room with MA only 






2 


Uses toilet room with HA, with or without MA 






3 


Does not use toilet room 


8 


Behavior (Type) 


0 


Appropriate 






1 


Inappropriate, wandering or passive 






2 


Inappropriate, abusive or aggresive 






3 


Inappropriate, other 


9 


Cigarette Smoking 


0 


Not current smoking 






1 


Smoking less than 15 cigarettes/day 






2 


15-24 cigarettes/day 






3 


25 or more/day 


10 


Anemia 


0 


Not present 






1 


Present (Aplastic, pernicious, folic 
acid, sickle cell, or combinations) 


11 


Mental Illness 


0 


Not present 






1 


Present 


12 


Chronic Respira- 


0 


Not present 




tory Disease 


1 


Present (Asthma, chronic bronchitis, 
emphysema, removal of one lung (pneumonec- 
tomy) ) 



Figure IV. 7 continued 
Table of Values 
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Total 1 or Total 1 or Total 1 or 

Total 2 Value Total 2 Value Total 2 Value 



-5.0 


0.007 


-0.9 


-4.9 


0.007 


-0.8 


-4.8 


0.008 


-0.7 


-4.7 


0.009 


-0.6 


-4.6 


0.010 


-0.5 


-4.5 


0.011 


-0.4 


-4.4 


0.012 


-0.3 


-4.3 


0.013 


-0.2 


-4.2 


0.015 


-0.1 


-4.1 


0.016 


0.0 


-4.0 


0.018 


0.1 


-3.9 


0.020 


0.2 


-3.8 


0.022 


0.3 


-3.7 


0.024 


0.4 


-3.6 


0.027 


0.5 


-3.5 


0.029 


0.6 


-3.4 


0.032 


0.7 


-3.3 


0.036 


0.8 


-3.2 


0.039 


0.9 


-3.1 


0.043 


1.0 


-3.0 


0.047 


1.1 


-2.9 


0.052 


1.2 


-2.8 


0.057 


1.3 


-2.7 


0.063 


1.4 


-2.6 


0.069 


1.5 


-2.5 


0.076 


1.6 


-2.4 


0.083 


1.7 


-2.3 


0.091 


1.8 


-2.2 


0.100 


1.9 


-2.1 


0.109 


2.0 


-2.0 


0.119 


2.1 


-1.9 


0. 130 


2.2 


-1.8 


0.142 


2.3 


-1.7 


0.154 


2.4 


-1.6 


0.168 


2.5 


-1.5 


0.182 


2.6 


-1.4 


0.198 


2.7 


-1.3 


0.214 


2.8 


-1.2 


0.231 


2.9 


-1.1 


0.250 




-1.0 


0.269 





0.289 


3.0 


0.953 


0.310 


3.1 


0".957 


0.332 


3.2 


0.961 


0.354 


3.3 


0.964 


0.378 


3.4 


0.968 


0.401 


3.5 


0.971 


0.426 


3.6 


0.973 


0.450 


3.7 


0.976 


0.475 


3.8 


0.978 


0.500 


3.9 


0.980 


0.525 


4.0 


0.982 


0.550 


4.1 


0.984 


0.574 


4.2 


0.985 


0.599 


4.3 


0.987 


0.622 


4.4 


0.988 


0.646 


4.5 


0.989 


0.668 


4.6 


0.990 


0.690 


4.7 


0.991 


0.711 


4.8 


0.992 


0.731 


4.9 


0.993 


0.750 


5.0 


0.993 



0.769 

0.786 

0.802 

0.818 

0.832 

0.856 

0.858 

0.870 

0.881 

0.891 

0.900 

0.909 

0.917 

0.924 

0.931 

0.937 

0.943 

0.948 
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classified into the three levels of care thus generate 
demands for nursing services as approximated by the 
tri-level time estimates derived from McKnight ' s Col- 
orado Study [41] . Allocation and assignment of resources 
to meet these demands is the subject of the models pro- 
posed in Chapter III. 

The factors identified by the methodology used 
here seem to be in agreement with the findings of other 
researchers as reported in Chapter II with those few 
differences that exist being noted. In general, the com- 
mon items tend to be of the ADL (Activities of Daily 
Living) and psycho-social variety, with the various 
identified medically defined conditions seeming to depend 
on the sample used. In terms of comparison of the pre- 
dictive and recognitive powers of the system used here 
with previous work, only the results of Parker and Boyd 
[53] might provide a rough benchmark. With the total 
sample of 623 patients scored on the CPAI and all 60 
variables in use, a comparison of their discriminant 
analysis findings for what we have termed Int . B, Int. A, 
and SNF patients reveals an overall prediction rate of 
.73 and recognition rate of .79 with about 18% of misses 
on these groups coming at more than one step off the main 
diagonal of the classification matrix. Note, however, 
that with a total of 6 levels being fitted, it rightly 
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could be said that their model had more opportunities to 
miss than ours, which is "locked in" for the middle 
category. Nevertheless, the 12- and 9-variable problems 
give a reasonably close result to that reported by Parker 
and Boyd so as to be encouraging. 

It should be emphasized that the patient classi- 
fication system proposed here can only be as accurate a 
reflection of patient health status as allowed by the 
data used in its derivation. Therefore, it is recom- 
mended that replication of the study be undertaken, uti- 
lizing a similar methodological approach with additional 
data as they become available. 

Our purpose has been to demonstrate a reasonably 
simple methodology and approach to the derivation of 
nursing service demand data for use in allocation models. 
In addition, however, the classification scheme can be 
used to assist the appropriate agencies in patient level 
of care placement. That is, the results of application 
of the instrument can serve to "flag" patients who defi- 
nitely require certain levels of nursing care. This 
information, in turn, can be combined with other inputs 
to arrive at an acceptable decision. 

The necessity for additional research in the 
area of subset selection for multivariate methods empha- 
sized by Fisher, et al. [23] is reiterated here because 
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of its obvious importance in both studies of this 
nature and a variety of other applications. 



Chapter V: Mathematical Programming Solution 

Methodology: Parametric Mixed-Integer 

Linear Programming 

V . 1 Introduction 

In Chapter III we noted that the examination of 
solutions of the Basic Staffing Model with alterations 
in the parameters provides valuable information for both 
the facility administrator and the director of nursing. 
Specifically, we saw that by extending the Basic Staff- 
ing Model to the forms of Model I and Model II we could 
interpret the results obtained by parametric changes in 
the two models to yield insights into the following four 
aspects of the administrator's problem: 

1. Alterations in the staff mix as a result of 
changing service levels; 

2. Alterations in the staff mix as a result of 
changing budget limits; 

3. Changes in legal staffing requirements; 

4. Alterations in the priority-related limits 
on patient demands. 

The Basic Staffing Model, 
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MAX 



s . t . 
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Is.n. ^ Budget 
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££ x. - a.n. 0 , 
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5 X ijk f jk 



= 0 



L . n . ^ U . and integer 

l—i—i 



L . 



jk — ? jk — U jk 



x . > 0 , 

ljk — ' 



V i 

* j,k 

V i 

V j ,k 

^ i/ j fk , 



(V.l) 



was therefore modified in such a way that the first 
and second of the four aspects above were subsumed in 
Model I, while Model II was designed to relate to the 
remaining considerations. We restate these two models 
below for purposes of completeness as (V.2) and (V.3). 






s.t. £ s.n.^. Budget 
i 



(Model I) 



££, x . . . - a.n. 

jk 1 1 - 


0 , 


^i 


£ x . ., - ft .. =0 

f iD k r D k 

Sfr* - s 


/ 


V j ,k 
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max £ ££ c . . . x . + ryp a. 



jk 



s.t. E s.n.^- Budget 
i 11 



iZx.., - a.n. *0, 
jk « k 1 1 - 

(Model II) 

1 x ijk‘ P jk = 0 ' * j ' k 



( V . 3 ) 



“Pjkf S 

3 k J 

L . +0 (L^-L . )^n . <:U . +«t(U^-U . ) and integer, V i 
i li — i—i i i ^ 

L jk +8(L jk- L jk ) iPjki u jk +ok(u jk- u jk ) ' i' k 



x ijk >°, 

, e € [0,1] 



NJ i, j ,k 



We have previously indicated that solutions to 
each model with alterations in .the service level (S) 
the budget, and the upper and lower bounds via («< ,9) , 
respectively, could be accomplished by resolving each 
altered problem from scratch. In this chapter, however, 
we will propose two algorithms, one for each model form, 
which will allow for the efficient examination of a 
number of alternative values of S and the budget in 
Model I and the pair (®< ,0) in Model II. This will be 
accomplished by incorporation of the concepts of para- 
metric programming into an enumerative mixed-integer 
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linear programming algorithm. 

In general, parametric linear programming deals 
with the study of the optimal solutions to a family of 
mathematical programs related by a parametric representa- 
tion of one or more model data components. Included 
within the general heading of parametric programming are 
the class of techniques known as "postoptimality analy- 
sis." The main feature of these techniques is that the 
examination of the objective function as a function of 
the parameters takes place after some previous problem 
has been solved to optimality with a specific set of 
parameters. In this way, a minimum of effort relative 
to complete problem resolution need be expended in find- 
ing the new optima. The algorithms to be proposed in 
this chapter will gain their efficiency by incorporation 
of the ideas of postoptimality analysis. 

For review purposes, the basic ideas of para- 
metric linear programming will first be discussed, fol- 
lowed by an examination of mixed-integer linear program- 
ming (MILP) solution techniques. In conjunction with 
this latter topic, the literature survey of the para- 
metric MILP problems will be presented. Then a basic 
new parametric MILP algorithm will be given, and its 
applicability to Models I and II will be discussed. 
Finally, algorithms designed to solve the generic class 
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of problems represented by the structures of the two 
models will be stated, followed by the presentation of 
computational results for randomly generated example 
problems. We shall postpone to Chapter VI an example 
of the use of both algorithms in connection with the 
solution of the facility administrator's problem. 

V.2 Parametric Linear Programming 

The parametric MILP algorithm proposed is based 
on the MILP solution technique known as branch and bound 
(cf. Garfinkel and Neuhauser [25]). Branch and bound 
uses linear programming quite extensively, and since we 
will incorporate the parametric aspects of the new algo- 
rithm in the LP routine, we briefly review some of the 
major concepts of parametric linear programming. For the 
purposes of the exposition that follows, note that in 
general the upper and lower bounds on program variables 
could be explicitly included as model constraints. In 
this manner, the parametric representation of Model I and 
Model II can both be subsumed in the parametrization of 
the right-hand-sides (requirements vector) . 

After addition of the appropriate slack and sur- 
plus variables we may write the general linear program 
(LP) as 
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MAX cx 

(LP) s.t. A x = b ( V . 4 ) 

x > 0 

where the constraint matrix A is (m x n) , the objective 
coefficients vector c is (1 x n) , the vector of vari- 
ables x is (n x 1) , and the requirements vector b is 
(m x 1) . For any set of basic variables x fi we have 

x B = B -1 b - B _1 N_x n >0 (V . 5) 

where B ^ is the inverse of the basis matrix and is 
(m x m) , N is the matrix of non-basic columns of A and 
is (m x n-m) , and x^, the vector of non-basic variables, 
is (n-m x 1) . If B is an optimal basis, then = 0_. 

Suppose now that a change in the vector b is 
under consideration. Such a change may include altera- 
tions in the entire vector or a subset of its elements. 
In general, any alteration in the parameters of (LP) can 
affect either the optimality of a given solution or its 
feasibility. The optimality and feasibility criteria 
are : 

(optimality) z^-c^=c_^B ^a^-c^O, j=l,2,...,n (V.6) 

(feasibility) x B =B — — (V.7) 

Obviously a change in b has no effect in (V.6) , but can 
possibly affect (V.7) . As demonstrated by Hadley [28] , 
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if we write the altered requirements vector, b* as 

b* = b + Or (V . 8 ) 

where for convenience we choose 0fi[O,l] and r to yield 
the desired range of consideration in the elements of b 
under study, then (V.7) becomes 

x£ = B _1 b* = B _1 (b+0r) (V.9) 

= B _1 b + 0B _1 r > C_ 

Thus, x£ will be non-negative for certain values of 0. 

In fact, if B "*"r is non-negative, then the feasibility 
condition will be met for any 0€[O,1]. Writing the 
vector B ^r as y, suppose that some elements of y are 
negative. Then based on (V.9) we can find that value of 
0, called 0 , such that some x * 4 0 for © > 0 , where 

C B ^ C 

, y i 4 0 . (v.10) 

Then for 0>© c , the optimality condition is satisfied, 
but the feasibility condition is violated. For any 
specific value of 0 > , the optimal solution can be 
found (if one exists) using the dual simplex algorithm, 
starting from the optimal basis obtained earlier. Thus, 
if only certain values of 0 ; say 0^ , • • • • / are °f 

interest, the feasibility condition (V.9) can be checked 
for these values. If the feasibility condition is vio- 
lated, we use the previous basis to reoptimize as 



0 = mm 
c 

l 



’ X B. 
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before . 

An interesting property of the type of post- 
optimality analysis under consideration here is given 
without proof in Theorem 1, first postulated by 
Dantzig [21] . 

Theorem 1 : For the problem (LP) , the objective 

function Z = MAX c x is a piecewise- 
linear, concave function of the 
requirements vector b. 

We observe that such a strong result cannot be obtained 
in the case of the MILP (for further analysis of MILP 
continuity properties, see Radke [55]). 

A further consideration concerns the practicality 
of simultaneous parameterization of subsets of the 
requirements vector in terms of two or more parameters. 
For example, consider the problem posed below: 

MAX c x 



s . t . A.x, = 


b. 


+ 0 r, 




1—1 


-1 


-1 




A 2— 2 


—2 


+ A r 2 


(V.ll) 


x > 0 









0, A €[0,1] 
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The feasibility condition (V.9) requires that 

> 0 . (V . 12 ) 

Partitioning B ^ to be compatible with the row partition 
of the parametric alterations vector as 

B _1 = [D^ D 2 ] , (V . 13 ) 

we may write (V.12) as 

B -1 b + GD.^ +AD 2 £ 2 > 0 . (V . 14 ) 

The obvious difficulty is that each one of the above 
equations in the two unknowns 0 and A makes the deriva- 
tion of a condition analogous to (V.10) difficult. This 
result has implications for our study of Model II, where 
simultaneous parameterization of the upper and lower 
bounds is proposed. The combinatorial nature of such 
parameterizations may be resolved by fixing one parameter 
a priori thus reducing (V.14) to the single parameter 
case as in (V.9). 

In this section we have briefly outlined some 
important results of parametric linear programming. 

These results, with modifications, will form the basis 
for the parametric branch and bound algorithms to follow. 
Note that due to the nature of Models I and II we have 
only considered alterations in the requirements vector b. 



-l" 


+ B 1 




—2 




*e 2 _ 
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There exist equally well-developed methodologies for 
dealing with the parameterization of the objective func- 
tion coefficients vector c and the elements of the con- 
straint matrix A. For a detailed explanation of these 
results the reader may consult Hadley [28] . 

V.3 Parametric Integer and Mixed Integer Linear Program- 
ming 

When all or a subset of the variables of a linear 
program are constrained to be integers, there is a con- 
siderable increase in difficulty in solving the problem. 
It is well known that this is essentially due to the loss 
of the properties of the LP feasible region upon which 
the simplex algorithm is based. That is, there is no 
longer any guarantee that the optimal solution will lie 
at an extreme point of the feasible region. Obviously, 
therefore, the postoptimality analysis results applicable 
in the LP case cannot be extended to the case of the 

integer linear program (ILP) or MILP. Under some very 
special circumstances, however, the application of the 
simplex algorithm and subsequent LP postoptimality analy- 
sis will be appropriate. If the sufficient condition of 
total unimodularity (cf. Garfinkel and Neuhauser [25], of 
the ILP or MILP constraint matrix A is met, LP results 
may be applied to yield feasible solutions. Then, too. 
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it may be the case that A possesses some other special 
structure which allows the use of LP. 

Because of the general inappropriateness of the 
simplex method for use in the MILP, various other solu- 
tion techniques have been developed (cf. [25], [60]). 
These techniques essentially fall into three major 
classes : 

1. Cutting planes; 

2. Benders partitioning; 

3. Branch and bound. 

Based on the theory of each of these methodologies, 
several authors ([26], [1], [6], [7], [59], [49]) have 

proposed extensions to deal with the questions of post- 
optimality analysis with varying degrees of success. We 
shall therefore briefly review each solution technique, 
indicating also the subsequent efforts in the realm of 
postoptimality analysis. Note that the efforts to be dis 
cussed in connection with cutting planes and Benders par- 
titioning are concerned with closing the "duality gap" 
and the concept of "sensitivity analysis," both to be 
defined below. While not directly applicable to our work 
the review of such efforts serves not only to give his- 
torical perspective, but also to indicate how postopti- 
mality analysis might be carried out for small changes 



in the requirements vector. We therefore consider the 
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techniques of sensitivity analysis ss a subclass of the 
techniques of postoptimality analysis. 

V.3.1 Cutting Planes 

The cutting plane approach, primarily due to 
Gomory, is based on the philosophy of adding additional 
constraints, called "cuts," on the feasible region of the 
LP relaxation of the MILP. The cut constraint formation 
is based on information contained in any row of the sim- 
plex tableau (called the "source row") and such constraints 
are appended to the tableau. In general, the additional 
constraint will cause primal infeasibility, necessitating 
the application of the dual simplex algorithm. It is 
guaranteed that no mixed-integer feasible solution will 
be excluded by application of the cuts. For further de- 
tails on the specific methodology, the reader is referred 
to Garfinkel and Nemhauser [25] or Salkin [60] . The 
mathematics of cutting plane algorithms are quite elegant, 
leading one to believe the methods might be computation- 
ally attractive. This assumption has not been borne out 
in practice, however, where it is often the case that the 
finite number of steps needed for algorithmic termination 
are, in fact, unachievable in a practical sense. 

Because the cutting plane method is based on the 
solution of the LP relaxation of the MILP, with the 
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constraint set expanded by the addition of cuts, it is 
natural to conjecture that the resulting optimal LP 
might be useful in performing postoptimality analysis. 

One such attempt was made by Gomory and Baumol [26] for 
the ILP , but the analysis was somewhat different from 
what we have to this point called postoptimality analysis. 

The Gomory and Baumol approach is based on examination of 
a "sensitivity analysis" of the final simplex tableau at 
termination of the cutting plane algorithm; that is, how 
sensitive is the optimal objective function value to 
marginal changes in the requirements vector. Such informa- 
tion may be obtained from an examination of the dual vari- 
ables, available as the non-basic slack variables of the 
primal LP simplex tableau or from solution of the dual 
linear program. A well-known result with respect to the 
primal and dual LPs is given without proof in Theorem 2. 

Theorem 2 ; If the primal and dual linear pro- 
grams possess feasible solutions x 
and y respectively, then x and y are 
also optimal to their respective problems, 
and further, 

cx = yb (V. 15) 

where yb is the dual objective function. 

In the case of the MILP, Theorem 2 has no direct inter- 
pretation since the dual for the MILP is not defined. 
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Nevertheless, in the solution of the LP relaxation of the 
MILP,(V.15) is true. In general, however, 

££mILP — — LP (V * 16) 

Thus the difference between the objective function values 

is called the "duality gap." If such a gap exists, the 
results of LP sensitivity analysis cannot be applied. 

The Gomory and Baumol approach is to use the dual varia- 
bles associated with the cut constraints to recompute 
the dual variables associated with the original constraints. 

By "imputing back" the effect ofthe dual vari- 
ables obtained using the cuts to the duals associated 
with the original constraints, the relation (V.15) was 
shown to be true for active (or tight) original con- 
straints only. Although in the usual LP problem inactive 
constraints have a zero dual variable associated with 
them, this property may or may not be true as a result of 
recomputation. Further, the recomputed duals are non- 
unique, depending upon the order of cut generation. Thus, 
the Gomory and Baumol approach does not completely solve 
the sensitivity analysis question. 

Some years later, Alcaly and Klevorick [1] , 
recognizing the shortcomings of the Gomory and Baumol 
proposal, asserted a system of "generalized dual prices" 
which remedied the problems with inactive constraints. 
Nevertheless, the duality gap was shown to exist even 
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under this new methodology, leaving ILP sensitivity 
analysis an open question. 

V.3.2 Benders Partitioning 

The Benders partitioning method of MILP solution 
relies on the use of information obtained by iteratively 
solving two problems partitioned from the original MILP. 
Following the exposition given in Garfinkel and Nemhauser 
[25] , we write the initial MILP as 
MAX c^x + c_ 2 y 

s.t. A^x + A y £ b (V.17) 

x > 0 and integer 

y > o 

The Benders LP problem is 

MAX c^x + min u(b - A^x) 
u 

x > 0^ (V. 18) 

s.t. u A 2 > c 2 

u > 0 , 

and is obtained by taking the dual of (V.17) with x 
fixed at some value, thus reducing the MILP to an LP . 

The second Benders problem, an ILP, is given as 
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MAX u 



o 



S . t . U c. 

o 1 



x + u t (b-A^x) , V u^" ft T 



(V . 19 ) 



9 . — (b-A^x) 




x > 0 and integer 



1 1 t 

where T = (u lu is an extreme point of the constraint 
set of (V.18)) and Q = (y^ u*' + 0y^, 0 > O^is an extreme 
ray for some u t € T) . Benders' algorithm calls for the 



iterative application of (V.19) to determine an x to be 
used in (V.18), from which either an optimal solution is 
obtained or the sets T and/or Q are altered. Problem 
(V.19) is again solved, and so on. The algorithm can be 
shown to be finite. 

Based on the concepts of the partitioning method , 
Balas [7,8] proposed a constructive approach for solving 
the duality gap problem, thereby allowing resolution of 
the MILP sensitivity analysis question. As the initial 
step, a dual problem to (V.17) is defined as follows: 

MAX MIN c^x + u(b-Aj x) 
x u 



s.t. 



H A 2 - v = c 2 



(V . 20 ) 



u > 0 



x > 0 and integer 



v . unconstrained 
J 

Vj > 0, j*N 2 



jftN x 



/ 
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where is the index set of x and the index set of 
y. The solution of the MILP (V.20) yields v, from 
which a vector of "subsides," s_ may be derived. These 
subsidies, in turn, lead to the construction of the 
following special LP, formed from the relaxation of 
(V . 17 ) : 

MAX (Cj+s) x + c^y 

s.t. A ± x + A 2 y £ b ( V . 2 1 ) 

x, y > 0_ , 

Balas demonstrates that 

(c i+s ) x + c^y = , (V.22) 

and thus the duality gap is closed. This is because an 
optimal solution to (V.17) will also be optimal to 
(V.21), and the dual variables of (V.21) are the proper 
values to use in a sensitivity analysis of the original 
MILP. The connection with Benders partitioning method 
is clear when (V.20) is examined. Assuming we fix x to 
be "admissible," (V.20) is the dual LP analogous to 
(V.18). The appropriate analog for (V.19) may also be 
derived . 

In spite of the fact that the MILP sensitivity 
analysis question appears to be answered, this method 
nevertheless has its drawbacks. Of major concern is that 
solution of (V.21) will not, in general, yield an MILP 
feasible solution. The reorientation of the objective 
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function simply assures that the MILP solution may be an 
alternate optimum. Any change in b would be reflected 
in the solution for v, thereby changing s. Hence, even 
if we could guarantee finding the proper solution, (V.21) 
may not be helpful in solving the postoptimality analysis 
problem. Attempts in connection with our study to find 
an efficient scheme for updating v could not be found, 
any such alteration requiring a complete resolution of 
(V.20), which is tantamount to resolving the MILP from the 
outset with the parametric change incorporated. 

V.3.3 Branch and Bound 

The method of Branch and Bound is probably the 
most widely applied MILP solution technique. Recognizing 
the essentially combinatorial nature of the MILP, this 
method seeks to identify those solutions which are domi- 
nated by others and to remove them from further consider- 
ation. Dominance is inferred from upper and lower 
bounds on the objective function obtained via the solu- 
tion of LP relaxations. 

If solution of the relaxation of the original 
MILP does not yield an MILP feasible solution (if it does, 
we are done) , an integer-restricted variable in the 
optimal solution which did not terminate integer is 
chosen. Calling this variable x£, two new LPs are formed 
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with constraint sets 

R 1 = ( (X'Y) |a i x + A 2 y_ £ b; x,y_ > 0, x ± > [x£] + 1) 

R 2 = ^ (x>Y) | A i x + A oI - X'Y. 19.' x i ^ tx*] ) 

This partitioning insures that no feasible solutions will 
be excluded. The LP with either or is chosen for 
solution, and the method proceeds in this fashion. An 
upper bound Z^ at problem i is obtained from the feasi- 
ble LP objective value. A lower bound, Z^, can be found 
from a MILP feasible solution. The maximal such lower 
bound is called Z^. For any problem with constraint set 
R^, we may curtail further consideration of it and its 
successor problems if 

Z = Z , or (V . 2 3 ) 

Z. ^ Z (V . 24 ) 

l — — o 

The condition (V.23) implies that no better MILP feasible 
solution can be found, while (V.24) yields the conclusion 
that the best known MILP feasible solution is better 
(has higher objective function value) than any succeed- 
ing LP relaxation. Both conditions result from the fact 
that the successor to a given problem is more constrained 
than the problem itself. 

The choice of variables upon which to partition, 
the order of problem consideration, and the improvement 
of bounds are all subjects of algorithmic improvement and 
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will not be considered here. Again, the reader is 
referred to the texts mentioned earlier. 

The parametric methodologies we have discussed 
in connection with the cutting plane and Benders parti- 
tioning algorithms have been concerned with the question 
of obtaining sensitivity analysis results for the MILP 
by resorting to some modified LP form. Utilizing the 
branch and bound algorithm, however, two authors. Rood- 
man [59] and Nauss [49], have been able to provide in- 
sights into the more general question of MILP postopti- 
mality analysis. 

The method of Roodman [59] is essentially a modi- 
fication of the branch and bound algorithm designed to 
extract sufficient information such that 

1. The critical values for parametric repre- 
sentation of the requirements vector and/or 
the objective function coefficients may be 
determined; and 

2. Efficient problem resolution after a specific 
parametric alteration in the requirements 
vector and/or objective function coefficients 
is possible. 

A crucial factor in the application of the method is the 
initial statement of the MILP in the form of the most 
restrictive problem to be considered. That is, for the 
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problem (V.17) , c + and b + are taken to be the minimum 
possible values. In addition, the MILP must be trans- 
formed so that the integer-restricted variables are 
binary, and the continuous variables are defined on the 
interval [0,1] . 

Initially, the restricted MILP is solved to 
optimality using branch and bound. The reason for 
fathoming (deleting from further consideration a problem 
and its successors) each node is noted as being due to 
one of the following three conditions: 

1 . Bounds ; 

2. Integrality; 

3. Infeasibility. 

For conditions (1) and (2) , critical values of the para- 
meters involved in the objective function and require- 
ments vector such that the fathoming condition is just 
violated are determined. For condition (3) , a critical 
value for the requirements vector such that the problem 
would become feasible is found. All such information for 
every fathomed node must be stored. The minimum / taken 
over all nodes, of the first two of these values gives 
the critical values for the objective function and 
requirements vector parameters such that below these 
values the current optimal solution remains optimal . 

For a proposed parametric change, the appropriate 
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critical value is considered for each node (depending on 
the fathoming condition) to determine if it is violated. 
If so, these problems, with rhe change incorporated, 
must be resolved, and the branch and bound algorithm 
continued from these nodes. 

In retrospect, Roodman ' s method captures the 
essence of the MILP postoptimality analysis problem. 
Unfortunately, it has several limitations, among which 
are the following: 

1. The requirement for initial transformation 
of the MILP; 

2. Potentially large amounts of information to 
be stored (e.g., fathoming conditions, 
critical values, etc., for every node); 

3. LPs to be resolved must be started from 
scratch (if, for example, the optimal bases 
are retained, this greatly increases storage 

requirements) ; 

4 . No use is made of LP parametric programming 
techniques . 

When taken together, it appears that these factors 
detract from the computational efficiency of the algo- 
rithm. 

In a recent study, Nauss [49] considered the 
parametric integer linear programming problem (PILP) for 
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the following two cases: 

1. Parameterization over a finite number of 
points ; 

2. Varying one parameter continuously over a 
specified range. 

The general (PILP) is proposed as 
MIN (c+f k )x 

(PILP) s.t. (A+D k )x > b +r k (V.25) 

x > 0 and integer; k=l , 2 , . . . ,k* . 

The solution of PILP is based on the branch and bound 
algorithm, modified by "problem dependent techniques" 
and "solution priorities" to be discussed below. The 
ultimate application of Nauss' analysis is to each of 
three specific PILP problems: the generalized assign- 

ment problem, the 0-1 knapsack problem, and the capaci- 
tated facility location problem. 

Initially, a rudimentary and general algorithm 
is given for the solution of (V.25). Defining 

P k R = PILP indexed on k with implying a given 
restriction on the problem at node i (R q is 
the original constraint set) . 

The method proceeds as in the branch and bound algorithm, 
but all k* problems are, in general, considered at every 

node. If a fathoming condition is encountered for the 
t h 

k problem, consideration of this problem may be 
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deleted at successor nodes. Several open questions exist 
in connection with the PILP algorithm, among them 

1. The form of the relaxation of P, to employ, 

K , 

2. Any initial setup of PILP to take maximum 
advantage of problem structure, 

3. The branching rule to invoke, and how to 
apply it to the remaining problems at a 
node . 

No specific procedures are advanced to answer these ques- 
tions . 

Nauss proposes several problem dependent tech- 
niques to improve algorithmic efficiency for specific 
problem structures. The first of these is "problem reduc- 
tion," attained by pegging as many variables as possible 
to their optimal solution values and eliminating con- 
straints found to be dominated by others. This result is 
most powerful when the reduction is applicable to all 

problems of the PILP. "Feasibility recovery" involves 

t h 

the use of an existing optimal solution to the k prob- 
lem to generate a feasible solution (and upper bound) for 
s t 

the k+1 problem. Third, "bounding problem reoptimiza- 
tion" is indicated in algorithms using LP as the relaxa- 
tion. The author proposes that LP parametric program- 
ming techniques such as those discussed in Section V.2 
might be a way to solve the relaxations of all problems 
of the PILP. Finally, the author suggests "wide range 
bounding" as a technique involving the use of the 
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Lagrangean dual problems to generate upper bounds for 
the problems of PILP. 

Solution priorities deal with the distribution 
of effort in solving the PILP; that is, how we should go 
about solving the k* problems. The "purely serial" 
approach calls for solution of one problem to optimali- 
ty, then gleaning as much information as possible from 
it to assist in solving the next problem, and so on. 
Information such as the separation of variables, upper 
and lower bounds, fathoming conditions, etc., can be 
helpful, assuming the course of enumeration for the next 
problem is similar to that of its predecessor (note that 
Roodnan's algorithm is an example of the purely serial 
approach) . The "lexicographically serial" approach is 
based on the continued solution of problem 1 to those 
nodes where a fathoming condition is encountered for it, 
then switching to problem 2, etc., until all k* prob- 
lems are considered. The final solution priority, the 
"parallel" approach, has all k* problems, each with the 
same , under consideration at every node. When a 
fathoming condition for problem k is met at a node, the 
problem is not considered at successors to that node. 
Ultimately, when all problems remaining at a node have 
been fathomed, the node itself is fathomed. Nauss states 
that, "the parallel approach ... relies on the assumption 
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that solving a series of closely related problems at a 
given node can be accomplished relatively efficiently." 
[49, p. 38] He recommends the first two priorities over 
the parallel approach. 

An interesting result due to Noltemeier [51] and 
cited by Nauss is given as Theorem 3 : 

Theorem 3 : (Noltemeier) Given that c, A, and 

b are integer quantities, then the 
PILP parameterized in the require- 
ments vector only may be transformed 
into an equivalent PILP of the form 
MIN cx 

s . t . Ax > b + t k ( V . 2 6 ) 

x > 0 and integer 

where t^ is an integer vector con- 
formable with b for k=l , 2 , . . . , K* . 

As a result of this theorem, continuous parameterization 
of the requirements vector may be reduced to the discrete 
case for the PILP. This result is not true, in general, 
in the parametric mixed-integer linear programming case. 

The work of Nauss is a major effort in parametric 
programming. Little attention is given, however, to the 
mixed-integer case. In addition, several open questions 
pertaining to algorithmic implementation remain. These 
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questions were answered for the three problem forms 
mentioned earlier, but not for general PILP. As a final 
note, Nauss also recognized the difficult combinatorial 
problem implied by simultaneous parameterization of pro- 
gram data with different parameters, but did not recom- 
mend procedures to alleviate the difficulties. 

V.4 A Proposed MILP Parametric Programming Algorithm 

We now propose an enumerative algorithm for solv- 
ing the following parametric mixed-integer linear program 
(PMILP) . 

MAX c^x + c 2 y 

s.t. A^x + A 2 y = b^. (V.27) 

(PMILP) x > 0^ and integer 

y _> 0_ 

k=l , 2 , . . . ,K* 

We call the individual problems of (V.27) (P^) / and the 
LP relaxations of these problems (RP^.) • A parallel solu- 
tion approach will be taken. Further, we assume that as 
the index k increases, the problems of the PMILP become 
more restrictive. Note that the form of (V.27) implies 
that the continuous parameterization of b has been re- 
duced to the discrete case. In the absence of the result 
of Theorem 3, the discretization is certainly arbitrary. 
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and as such may be chosen in accordance with the problem 
setting . 

In our modified branch and bound algorithm, we 

wish to consider (RP^) for all k at any node j. Let 

1c k t h 

(x j , y j ) be a solution to the k relaxed problem at 

node j. Then if we write the constraint set of (RP^) J 

(including additional restrictions due to branching) as 

T k and that of (P, ) 3 as S k , it is immediate that 
3 k 3 

(Xj, y k ) 6 . Tj => (x k , y ^ ) € S k , (V.28) 

since, due to the nature of the LP relaxation 



We can write 





(V . 29 ) 



z (k, y) = c^x + c_ 2 y , 



(V . 30 ) 



from which we can infer the following relationships for 
our problem: 



Z* 

3 



k 



"z(x* k , y* k ) if (x* k , y* k ) is optimal to (P, ) 3 
3 3 3 3 k 



k -ooif S 



0 



oo if (P ) ^ is unbounded 



Due to (V.28) and (V.29) we can obtain an upper bound, 

Z . k , on Z* k by solving (RP, ) ^ to yield 
3 3 k 



